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Preface 


This textbook is intended to serve a course on mathematical methods of physics that is 
often taken by graduate students in their first semester or by undergraduates in their senior 
year. I believe the most important topic for first-year graduate students in physics is the 
theory of analytic functions. Some students may have had a brief exposure to that subject 
as undergraduates, but few are adequately prepared to apply such methods to physics prob- 
lems. Therefore, I start with the theory of analytic functions and practically all subsequent 
material is based upon it. The primary topics include: theory of analytic functions, integral 
transforms, generalized functions, eigenfunction expansions, Green functions, boundary- 
value problems, and group theory. This course is designed to prepare students for advanced 
treatments of electromagnetic theory and quantum mechanics, but the methods and appli- 
cations are more general. Although this is a fairly standard course taught in most major 
universities, I was not satisfied with the available textbooks. Some popular but encyclope- 
dic books include a broader range of topics, much too broad to cover in one semester at 
the depth that I thought necessary for graduate students. Others with a more manageable 
length appear to be targeted primarily at undergraduates and relegate to appendices some 
of the topics that I believe to be most important. Therefore, I soon found that preparation 
of lecture notes for distribution to students was evolving into a textbook-writing project. 

I was not able to avoid producing too much material either. I usually chose to skip 
most of the chapter on Legendre and Bessel functions, assuming that graduate students 
already had some familiarity with them, and instead referred them to a summary of the 
properties that are useful for the chapter on boundary-value problems. Other instructors 
might choose to omit the chapter on dispersion theory instead because most of it will 
probably be covered in the subsequent course on electromagnetism, but I find that subject 
more interesting and more fun to discuss than special functions. The chapter on group 
theory was prepared at the request of reviewers; although I never reached that topic in one 
semester, I hope that it will be useful for those teaching a two-semester course or as a 
resource that students will use later on. It may also be useful for one-semester courses at 
institutions where the average student already has a sufficiently strong mastery of analytic 
functions that the first couple of chapters can be abbreviated or omitted. I believe that 
it should be possible to cover most of the remaining material well in a single semester 
at any mid-level university. I assume that the calculus of variations will be covered in a 
concurrent course on classical mechanics and that the students are already comfortable 
with linear algebra, differential equations, and vector calculus. Probability theory, tensor 
analysis, and differential geometry are omitted. 

A CD containing detailed solutions to all of the problems is available to instructors. 
These solutions often employ MATHEMATICA’ to perform some of the routine but tedious 
manipulations and to prepare figures. Some of these solutions may also be presented as 
additional examples of the techniques covered in this course. 
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Note to the Reader 


I chose to prepare my lecture notes and subsequent textbook using MATHEMATICA’ because 
I am very enamored of its facility for combining mathematical typesetting with symbolic 
manipulation, numerical computation, and graphics into notebook documents approaching 
publication quality. However, because students must learn the mathematical techniques in 
this course, not just the syntax of a program, practically all derivations in the body of the 
text are performed by hand with MATHEMATICA’ serving primarily as a word processor. 
The figures were also produced using MATHEMATICA’ , but most of the code for the figures 
has been removed from the main text. And, of course, I often checked my work using the 
symbolic manipulation tools of the program. However, I also discovered a disturbingly 
large number of integrals that MATHEMATICA evaluated incorrectly. Some of those errors 
have been corrected in later versions, perhaps due in part to my error reports, but inevitably 
new errors emerged even for integrals that were evaluated correctly in earlier versions! The 
lessons that students should learn from my experience is either caveat emptor (let the buyer 
beware) and trust but verify. The student must understand the mathematics well enough 
to recognize probable errors (the smell test) and to check the results of any mathematical 
software. Software is helpful, but no software is perfect! The wetware between your ears 
must evaluate the results of the software. 

I also adopted some of the notation of MATHEMATICA because it is often superior to 
the traditional notation of mathematical literature. For example, f[x] with square brack- 
ets indicates a function f whose argument is x while f(x) with parentheses indicates the 
product f « x. Although the target audience would rarely confuse delimiters intended for 
grouping with delimiters intended for arguments, anyone who has taught lower-level cour- 
ses has witnessed the havoc wrought by ambiguous notations. Therefore, I have gotten 
into the habit of using parentheses only for grouping terms, square brackets primarily for 
arguments (and commutators), and curly brackets for lists or iterators. Similarly, I use 
MATHEMATICA ’s double-struck symbols z for =I, e for the base of the natural loga- 
rithm, dfor differential, etc. Furthermore, I often distinguish between assignments (=) and 
equations (==) intended to be solved. I hope that most readers eventually agree that some 
of these nontraditional typesetting practices are actually preferable to traditional notation. 
Finally, I use several convenient acronyms: wrt for with respect to, rhs for right-hand side, 
lhs for left-hand side, and iff for if and only if. 

I encourage students to use mathematical software to perform some of the mundane 
tasks encountered in homework problems and to plot their solutions. Several examples 
are given in the text, where MATHEMATICA’ code is sometimes used to perform simple but 
tedious algebraic manipulations. However, software should not be used to circumvent the 
object of an exercise. For example, if the objective of a problem is to practice an integra- 
tion method, then simply quoting MATHEMATICA”s answer is not sufficient and sometimes 
would even be incorrect. It should be obvious when computer assistance is appropriate 
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XVI Note to the Reader 


and when it is not. Also, please take care to specify the assumptions made in the solu- 
tion of problems, carefully identifying the range of validity with respect to any parameters 
present. 

The student CD that accompanies this book includes the original MATHEMATICA’ note- 
book files used to prepare the text and supplementary notebooks that provide solutions to 
selected exercises, the code used to prepare figures, and some additional material where 
appropriate. We also provide a basic introduction to MATHEMATICA’. 


1 Analytic Functions 


Abstract. We introduce the theory of functions of a complex variable. Many familiar 
functions of real variables become multivalued when extended to complex variables, 
requiring branch cuts to establish single-valued definitions. The requirements for 
differentiability are developed and the properties of analytic functions are explored 
in some detail. The Cauchy integral formula facilitates development of power series 
and provides powerful new methods of integration. 


1.1 Complex Numbers 
1.1.1 Motivation and Definitions 


The definition of complex numbers can be motivated by the need to find solutions to 
polynomial equations. The simplest example of a polynomial equation without solutions 
among the real numbers is z? == —1. Gauss demonstrated that by defining two solutions 
according to 


? = -] =z = Łi (1.1) 
one can prove that any polynomial equation of degree n has n solutions among complex 
numbers of the form z = x + iy where x and y are real and where i? = —1. This powerful 
result is now known as the fundamental theorem of algebra. The object i is described as 
an imaginary number because it is not a real number, just as V2 is an irrational number 
because it is not a rational number. A number that may have both real and imaginary com- 
ponents, even if either vanishes, is described as complex because it has two parts. Through- 
out this course we will discover that the rich properties of functions of complex variables 
provide an amazing arsenal of weapons to attack problems in mathematical physics. 

The complex numbers can be represented as ordered pairs of real numbers z = (x, y) 
that strongly resemble the Cartesian coordinates of a point in the plane. Thus, if we treat 
the numbers 1 = (1, 0) and i = (0, 1) as basis vectors, the complex numbers z = (x, y) = 
xx1+yxX1i= x+ ly can be represented as points in the complex plane, as indicated in 
Fig. 1.1. A diagram of this type is often called an Argand diagram. It is useful to define 
functions Re or Im that retrieve the real part x = Re[z] or the imaginary part y = Im[z] 
of a complex number. Similarly, the modulus, r, and phase, 6, can be defined as the polar 
coordinates 


r= ye +y% O= AreTan|*| (1.2) 
X 


by analogy with two-dimensional vectors. 
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x=Re[z] 


Figure 1.1. Cartesian and polar representations of complex numbers. 


y=Im[z] 


x=Relz] 


Figure 1.2. Addition of complex numbers. 


Continuing this analogy, we also define the addition of complex numbers by adding 
their components, such that 


z tz = (x, +X, +) 27 +2 =(x, +2) x1+ (y, +12) xi (1.3) 


as diagrammed in Fig. 1.2. The complex numbers then form a linear vector space and 
addition of complex numbers can be performed graphically in exactly the same manner as 
for vectors in a plane. 

However, the analogy with Cartesian coordinates is not complete and does not extend 
to multiplication. The multiplication of two complex numbers is based upon the distribu- 
tive property of multiplication 


2129 = (x; + iyi), + 1y,) 
= XX) + Py yy + iY, + x31) (1.4) 


= (XX, — Y1Y2) + HAY. + X2J1) 
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y=Im[z] 


z=(x y) 


= x=Relz] 
O+r 


2=(-x,- y) Z=(x,-y) 


Figure 1.3. Inversion and complex conjugation of a complex number. 


and the definition i? = —1. The product of two complex numbers is then another complex 
number with the components 


ZZ = (xx, = Y1Y2 *1Y2 +x)y1) (1.5) 


More formally, the complex numbers can be represented as ordered pairs of real numbers 
z = (x, y) with equality, addition, and multiplication defined by: 


Zp = % =X, =X Ay, =), (1.6) 
z +z = (xı +41 + y) (1.7) 
Za XZ% = (xx = Vo» X Yo + XY) (1.8) 


One can show that these definitions fulfill all the formal requirements of a field, and we 
denote the complex number field as C. Thus, the field of real numbers is contained as a 
subset, R c C. 

It will also be useful to define complex conjugation 


complex conjugation: z = (x, y) => z* = (x, —y) (1.9) 
and absolute value functions 
absolute value: |z| = yx? + y (1.10) 


with conventional notations. Geometrically, complex conjugation represents reflection 
across the real axis, as sketched in Fig. 1.3. 
The Re, Im, and Abs functions can now be expressed as 


Z+Zz Vie 
= Im[z] = 


R 
el] = 5 2 


lz? = zz (1.11) 
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Thus, we quickly obtain the following arithmetic facts: 


i=(01) 2=-1 P=-i ïř=1 
scalar multiplication: c e R => cz = (cx, cy) 


additive inverse: z = (x, y) => —z = (-x, -y) = z+ (-z) =0 (1.12) 
a _ 1 x-i Zz 

multiplicative inverse: z™! = => y z5 
x+iy x+y kl 


1.1.2 Triangle Inequalities 


Distances between points in the complex plane are calculated using a metric function. 
A metric d|a, b] is a real-valued function such that 


dla, b] > 0 for all a + b 
2. dla, b] = 0 for all a = b 
3. dla, b] = d[b, a] 
4. dla, b] < dla, c] + d[c, b] for any c. 


Thus, the Euclidean metric d[z,, z3] = lz; — zəl = y (x, — 12)? + (y, — y? is suitable for 
C. Then with geometric reasoning one easily obtains the triangle inequalities: 


triangle inequalities : |lz,1 — lz,Il < Iz, + zəl < lz,1 + lz (1.13) 


Note that C cannot be ordered (it is not possible to define < properly). 


1.1.3 Polar Representation 


The function e* can be evaluated using the power series 


io _ O Oy" 2n gent 
e 


ar =) roo + i ‘ie Y rem 7 Coslél + ¿Sin(8) (1.14) 


giving a result known as Euler’s formula. Thus, we can represent complex numbers in 
polar form according to 


n= 


z = re? = x = rCos[0], (1.15) 
y=rSin[ð] with r= |z| = Je +y and 0=arglz] (1.16) 


where r is the modulus or magnitude and 6 is the phase or argument of z. Although addition 
of complex numbers is easier with the Cartesian representation, multiplication is usually 


1.1 Complex Numbers 5 


easier using polar notation where the product of two complex numbers becomes 


2,22 = r¡r¿(Cos[8, ] + i Sin[9, ])(Cos[6,] + i Sin[6,]) 
= r,r,(Cos[0,] Cos[6,] — Sin[, ] Sin[9,] + ¿(Sin[9, ] Cos[6,] + Cos[6,] Sin[9,])) 
= r,r,(Cos[0, + 0,] + ¿Sin[0, + 0,]) 

=r, r,ece1+ón 


(1.17) 


Thus, the moduli multiply while the phases add. Note that in this derivation we did not 
assume that e7! e? = e+, which we have not yet proven for complex arguments, relying 
instead upon the Euler formula and established properties for trigonometric functions of 
real variables. 

Using the polar representation, it also becomes trivial to prove de Moivre’s theorem 


(e®)" = e"? => (Cos[6] + iSin[9])” = Cos[n6] + iSin[n6] for integer n. (1.18) 


However, one must be careful in performing calculations of this type. For example, one 
cannot simply replace (e*"9)!/7 by el? because the equation, z” == w has n solutions {z,, k = 
1, n} while e”? isa unique complex number. Thus, there are n, n™-roots of unity, obtained 
as follows. 


z= re? =z = rein (1.19) 
Z=1=>r=1, n0=2kx (1.20) 
2nk 2nk 27k 
az= Expla] = Cos| ==] + ¿sin | for k=0,1,2,....n-1 (1.21) 
n n n 


In the Argand plane, these roots are found at the vertices of a regular n-sided polygon 
inscribed within the unit circle with the principal root at z = 1. More generally, the roots 


27k 
2 =w= pe” = q =p!" Expl io | for k=0 12... 1-1 (1.22) 
n 


of e'? are found at the vertices of a rotated polygon inscribed within the unit circle, as 
illustrated in Fig. 1.4. 


1.1.4 Argument Function 
The graphical representation of complex numbers suggests that we should obtain the phase 
using 


9 y 
6 = arctan | (1.23) 
x 


but this definition is unsatisfactory because the ratio y/x is not sensitive to the quadrant, 
being positive in both first and third and negative in both second and fourth quadrants. Con- 
sequently, computer programs using arctan[*] return values limited to the range (—§, 5). 
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Figure 1.4. Solid: 6th roots of 1, dashed: 6th roots of e*?, 


A better definition is provided by a quadrant-sensitive extension of the usual arctangent 
function 


ArcTan[x, y] = ArcTan B + =a — Sign[x]) Sign[y] (1.24) 


that returns values in the range (—z, 7). (Unfortunately, the order of the arguments is 
reversed between Fortran and Mathematica.) Therefore, we define the principal branch 
of the argument function by 


Arg[z] = Arg[x + iy] = ArcTan[x, y] (1.25) 


where —7 < Arg[z] < 7. 
However, the polar representation of complex numbers is not unique because the phase 0 
is only defined modulo 27. Thus, 


arg[z] = Arg[z] + 27n (1.26) 


is a multivalued function where n is an arbitrary integer. Note that some authors distinguish 
between these functions by using lower case for the multivalued and upper case for the 
single-valued version while others rely on context. Consider two points on opposite sides 
of the negative real axis, with y > 0* infinitesimally above and y > 0” infinitesimally 
below. Although these points are very close together, Arg[z] changes by 27 across the 
negative real axis. 

A discontinuity of this type is usually represented by a branch cut. Imagine that the 
complex plane is a sheet of paper upon which axes are drawn. Starting at the point (r, 0) one 


can reach any point z = (x, y) by drawing a continuous circular arc of radius r = yx? + y? 
and we define arg[z] as the angle subtended by that arc. This function is multivalued 
because the circular arc can be traversed in either direction or can wind around the origin an 
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e Argl(x,0%)]=+1 
° Arg[(x,0-)]=-7 


Figure 1.5. Branch cut for Arg[z]. 


arbitrary number of times before stopping at its destination. A single-valued version can be 
created by making a cut infinitesimally below the negative real axis, as sketched in Fig. 1.5, 
that prevents a continuous arc from subtending more than +7 radians. Points on the nega- 
tive real axis are reached by positive (counterclockwise) arcs with Arg[(—lzl, 0)] = 7 while 
points infinitesimally below the negative real axis can only be reached by negative arcs 
with Arg[(—lzl, 07)] > —7. Thus, Arg[z] is single-valued and is continuous on any path 
that does not cross its branch cut, but is discontinuous across the cut. 

The principal branch of the argument function is defined by the restriction -m < 
Arg[z] < 7. Notice that one side of this range is open, represented by <, while the other 
side is closed, represented by <. This notation indicates that the cut is infinitesimally below 
the negative real axis, such that the argument for negative real numbers is 7, not —7. This 
choice is not unique, but is the nearly universal convention for the argument and many 
related functions. The distinction between < and < many seem to be nitpicking, but atten- 
tion to such details is often important in performing accurate derivations and calculations 
with functions of complex variables. 

Many functions require one or more branch cuts to establish single-valued definitions; 
in fact, handling either the multivaluedness of functions of complex variables or the dis- 
continuities associated with their single-valued manifestations is often the most difficult 
problem encountered in complex analysis. Although our choice of branch cut for Arg[z] 
is not unique (any radial cut from the origin to co would serve the same purpose), it is 
consistent with the customary definitions of ArcTan, Log, and other elementary func- 
tions to be discussed in more detail later. The single-valued version of a function that 
is most common is described as its principal branch. For many functions there is con- 
siderable flexibility in the choice of branch cut and we are free to make the most conve- 
nient choice, provided that we maintain that choice throughout the problem. For example, 
in some applications it might prove convenient to define an argument function with the 
range -3 < MyArg[z] < z using the branch cut shown in Fig. 1.6. Consider the point 
zı = (-1, —1) for which the standard argument function gives Arg[z,] = -37/4 while our 
new argument function gives MyArg[z;] = 57/4. These functions are obviously different 
because the same input gives different output, but both represent precisely the same ray in 
the complex plane. Therefore, we should consider the specification of the branch cuts as 
an important part of the definition of a single-valued function and recognize that different 
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Figure 1.6. Branch cut for MyArg. 


choices of cuts lead to related but different functions. 

It is important to recognize that, because of discontinuities across branch cuts, simple 
algebraic relationships that apply to multivalent functions of complex variables, often do 
not pertain to their monovalent cousins. For example, using the polar representation of the 
product of two complex numbers we find 


2,22 = rır, ExpliO, + 0,)] = lz,221 = lzıllz2l arg[zız2] = arglz,] + arglzo] (1.27) 
but this relationship for the phase does not necessarily apply to the principal branch 
because 

Arglz,z,] = arglz,z,] + 2an 

= arg[z,] + arglz,] — 27n (1.28) 
= Arg[z,] + Arglz,] + 2a(n — ny — ny) 


where n must be chosen to ensure that —7 < Arg[z,z,] < 7. Often the price of single- 
valuedness is the awkwardness of discontinuities. 


1.2 Take Care with Multivalued Functions 


Ambiguities in the definitions of many seemingly innocuous functions require consider- 
able care. For example, consider the common replacement 


1 
AE eo? (1.29) 


that one often makes without thinking. Is this apparent equivalence correct? Compare the 
following two methods for evaluating these quantities when z > —1. 


natant (1.30) 


z= e" =>1M=g"?= (1.31) 
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Both calculations look correct, but their results differ in sign. These expressions are not 
always interchangeable! One must take more care with multivalued functions. 

If we represent the complex number z in Cartesian form z = x + iy where x, y are real, 
then 

1 1 x- ly 


= = 1.32 
z oxtiy x+y a 


If x < 0 and y > e where e is a positive infinitesimal, then z is just above and z7! is 


just below the usual cut in the square-root function (below the negative real axis). Conse- 
quently, Arg[z] and Arg[z7*] differ by 277 and the arguments of Y1/z and z~! differ by 
m, a negative sign, in the immediate vicinity of the negative real axis. It is usually not a 
good idea to use the surd (square-root) symbol for complex variables — for real numbers 
that symbol is usually interpreted as the positive square root, but for negative or complex 
numbers we should employ a fractional power and define the branch cut explicitly. Then, if 
we define —z < Arg[z] < x with a cut infinitesimally below the negative real axis the same 
cut would be implied for fractional powers and the value of z7'? determined using polar 
notation would be unambiguous on the negative real axis. Furthermore, 1/22 = 712 
applies everywhere in the cut z-plane without the sign ambiguity encountered above. Of 
course, the sign discontinuity across the cut is still present — it is an essential feature of 
such functions. 

Let us examine the square-root function, w = f[z] = 22, in more detail. When z is a 
positive real number, the square-root function maps one z onto two values of w = ++/x. 
Similar behavior is expected for complex z because there are always two solutions to the 
quadratic equation w? == z. In polar notation 


0 
z=re®, z=wW =w= vrExp|i, + nai (1.33) 


where, by convention, yr represents the positive square root for real numbers and where 
n = 0, 1 yields two distinct possibilities. Thus, the image of one point in the z-plane is two 
points in the w-plane. If we define w = u + iv, the component functions u[x, y] and v[x, y] 
can be obtained by solving the equations 


x== 042 — y? y == 2uv (1.34) 
Substituting v> y/2u and solving the quadratic equation for u?, we find 


riel + y yX +y +x mas 


40 — 4u?x -y = 0 = Ww = M aM’ 


where the positive root is required in u? to ensure real u. Then, solving for v and rational- 
izing the expression under the square root, we obtain 


(1.36) 
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Figure 1.7. Real and imaginary components of z!/?. 


rei] 


Argl 
$ 
¡9 


Figure 1.8. Dependence of Arg[z!"?] upon polar angle. 


Finally, taking the positive root of y?, we obtain 


(1.37) 


where the relative sign between u and v is determined by the sign of y. Note that there 
are only two, not four, solutions. The principal branches of the component functions are 
plotted in Fig. 1.7, where it is customary, though arbitrary, to select the positive branch of 
u so that positive square roots are obtained on the positive real axis. Similar figures are 
obtained for other positive nonintegral powers, rational or irrational. 

Notice that v = Im[yz] is discontinuous on the negative real axis. The real part is 
continuous, but its derivative with respect to y is discontinuous on the negative real axis. 
Consider the image of a circular path z = re’, 0 < 6 < 2m under the mapping w = yz. 
The argument of w changes abruptly from 7 to —7 as the negative real axis is crossed from 
above, as sketched in Fig. 1.8. 

In order to define a well-behaved monovalent function, we must include in the defi- 
nition of f a rule for selecting the appropriate output value when the mapping z > w is 
multivalent. The customary solution is to introduce a branch cut along the negative real 
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axis by restricting the range of the argument of z to —7 < 0 < m and agreeing not to cross 
the cut in the z-plane. Thus, Sqrt and Arg employ the same branch cut, shown in Fig. 1.5. 
We imagine that the cut is infinitesimally below the negative real axis so that the argument 
of negative real numbers is 7 and x < 0 => yx = iylx]. The discontinuity in Arg[z”] 
across the branch cut depends upon a. The end points of the branch cut are known as 
branch points at which discontinuities first open. Here, the most important branch point is 
at z = 0, but one often says that there is also a branch point at co. This somewhat sloppy 
language means that for large |z| the branch cut is parametrized by z = Re% with R > oo, 
but the choice of 0 remains arbitrary; here we happened to choose 6 = 7. 

Next consider the slightly more complicated function 

fid = (2-1)? =(@-D@e+ 0)” (1.38) 
Our experience with the square root suggests that we must pay close attention to the points 
z = +1 that are the branch points for (z + 1)'?. By factoring the argument of the square 
root and choosing ranges for the phase of each factor according to 


a, -n<0 sr (1.39) 
a, -n< sr (1.40) 


z=z-l=re’ 
% =e+1ane 


we obtain a single-valued version defined by 


filel= rre], ke 10.0 (1.41) 


and the branch cuts indicated in Fig. 1.9. The principal branch is defined, somewhat arbi- 
trarily, by k = 0 because that gives a positive root for z on the real axis with x > 1. The two 
heavy points show the branch points in the two factors (z + 1)'/? and the lines anchored 
by those points show the associated branch cuts. Here we decided to draw both branch 
cuts to the left, as is customary for Arg[z] or z'?, such that both polar angles are defined 
in the range —7 < 6,, < 7. Since it is clear that any discontinuities will be found along 
the real axis where the two phases may be discontinuous, the structure of the function can 
be investigated using strategically chosen points on both sides of the real axis labeled a—f 
in the figure and its accompanying Table 1.1. This table shows that f, [z] is discontinuous 
across the portion of real axis between the two branch points, namely —1 < x < 1, but 
is continuous elsewhere. In effect, the overlapping branch cuts cancel each other in the 
region x < —1 because, at least for this function, the discontinuity is simply a sign change; 
however, the behavior of overlapping cuts is not always this simple. 
Alternatively, if we define the phases according to 


yz =z-l=re%, 0<6,<2n (1.42) 
z=rztleane”, -n< <r (1.43) 


we obtain another version 


hkl] = Vrn Expl (2A + kr). ke (0, 1) (1.44) 


. ; 3 Ds ] i ; ; 2 ; 
in which the cut in z1” is now directed toward the right while the cut in 22 remains to the 
left; these cuts are illustrated in Fig. 1.10. The same selection of trial points now produces 
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Table 1.1. Selected values of f, [z] defined by Eq. (1.41). 


Point x y 6, 0, fl 
a x>l e 0 0 ve-1 
b x>l -e 0 0 We=1 
È -l<x<l E m 0 iV1-x 
d =1<x<l -£ -r 0 -iyl -x 
e x<-l e m m -Væ -1 
f x<-l -e -r -r -V2-l 
y 
1/2 
z—1 
=l) c 0; a 
x 
d 
(z+ 1)! 


Figure 1.9. Branch cuts for f, [z] defined by Eq. (1.41). 


Table 1.2 that shows that f,[z] is continuous for —1 < x < 1 but is discontinuous every- 
where else on the real axis. Although their algebraic definitions are the same, f,[z] and 
f,[z] are clearly different functions, fraternal twins that are distinguished by their branch 
cuts. The choice of cuts is a fundamental aspect of the definition of a single-valued ver- 
sion of an inherently multivalued function; the definition is not complete until the cuts are 
specified. For any particular application the most appropriate version may depend upon 
other aspects of the problem, such as physical boundary conditions, or may be chosen for 
convenience. If one is most interested in small values of |z| it will probably be more con- 
venient to choose f,[z], but for large values f, [z] is probably preferable, but consistency 
must be maintained through any particular problem. Furthermore, although the two ver- 
sions suggested here are the most common, they do not exhaust the possibilities. 

The moral of this somewhat belabored exercise is that one must be very careful in 
manipulating expressions involving complex variables and avoid making unintentional 
assumptions about the phases of various subexpressions. Most people tend to be very care- 
less with phases, automatically replacing Vs? by s without knowing whether s is positive 
or negative or complex. Similarly, many people complain that MATHEMATICA often does 
not perform simplifications that are perceived to be obvious. The reason for this is that 
MATHEMATICA’ hates to make mistakes and will not make unjustified assumptions about 


1.3 Functions as Mappings 13 


Table 1.2. Selected values of f,[z] defined by Eq. (1.44). 


Point x y 0, 8 ARI 
a x>1l € 0 0 Vi -1 
b x>l -e 2 0 -ve-1 
c -l<x<l e n 0 Wl- 
d -l<x<l -e n 0 ivI-xX 
e x<-l e T m Ve -1 
f x<-l -e m -n V¥-1 
y 
e > c O; a 
x 
f d 


Figure 1.10. Branch cuts for f, [z] defined by Eq. (1.44). 


whether a variable is real or, if it is, about its sign — it assumes that all variables are com- 
plex unless told otherwise. The Simplify function has an option that permits the user to 
specify permissible assumptions, such as one variable is real, another positive, a third a 
negative integer, etc. When you take responsibility for these assumptions, MATHEMATICA 
will usually go much further in simplifying your expressions. Often it still will not reach 
the elegant representation that one might find in a textbook, but its manipulations will be 
correct and that is what matters most. 


1.3 Functions as Mappings 


A function f maps the complex variable z = (x, y) into a complex image w = (u, v) accord- 
ing to rules specified in the definition w = f[z]. Thus, f maps points in the complex z-plane 
onto points in the complex w-plane, a mapping of C > C. For a single-valued function the 
image of a point is a point, but for a multiple-valued function the image of a point may 
be a set of points. For continuous functions, the image of a line segment (arc) in the input 
plane will be one or more arcs in the output plane. Often considerable insight into the prop- 
erties of a function may be obtained by examining the images of coordinate lines (lines 
of constant x or constant y). Below we analyze the mappings produced by some familiar 
functions. 
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8-6-4-2 R 4 6 8 


Figure 1.11. Mapping: u + iv = e**”, Lines of constant y are mapped onto radial lines while lines 
lines of constant x are mapped onto circles in the w-plane. 


1.3.1 Mapping: w = e* 


The exponential function is defined by the mapping 


w = e = e*(Cos[y] + éSin[y]) = u + iv = ulx, y] = e* Cos[y] vix, y] = e* Sin[y] 
(1.45) 


The image of a grid of coordinate lines is sketched in Fig. 1.11. Lines of constant y are 
mapped into radial lines, while lines of constant x are mapped into circles. The origin, with 
x = 0, is mapped onto the unit circle. Increasingly positive x = x) > 0, mapped onto circles 
of exponentially increasing radius e*, and increasingly negative x = x) < 0 mapped onto 
exponentially tighter circles, practically indiscernible in this figure. Thus, the images of 
the coordinate lines remain orthogonal, but the mapping severely distorts distances. 

It is important to recognize that the mapping produced by the exponential function is 
many-to-one because 


Exp[z + 27ik] = Exp[z] for integer k (1.46) 


is periodic, so that infinitely many input points z = z) + 27ik are mapped onto the same 
image point. Thus, any strip |y—yo| < 7 in the z-plane is mapped onto the entire w-plane and 
neighboring strips would replicate the covering of the w-plane. Consequently, the inverse 
function z = log[w] is many-valued because it represents a one-to-many mapping. By 
convention, we define the principal branch of the logarithm function 


Log[z] = Logllzl] + iArg[z] with -%7 < Argiz] < 7 (1.47) 


by limiting the phase y = Arg[w] to the strip —a < y < m by means of a branch cut along 
the negative real axis, as indicated by the thick line in Fig. 1.12. With this convention one 
obtains the following principal values: 

Log[1] =0 Logli] = > Log[-1] = in  Log[-i] = -7 (1.48) 
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Figure 1.12. Mapping: u +iv = Exp[x+iy]. The mapping of any strip ly—y,| < 7 covers the w-plane. 
The branch cut for the inverse mapping is shown in the w-plane. 


One might imagine the w-plane as a circular paper disk with a cut from its edge to the 
center, which prevents a continuous curve to be drawn crossing the cut. A vertical line 
segment in the z-plane between y = -r + € and y = a — e is mapped onto a circular 
arc between Arg[w] = —1 + £ and Arg[w] = a — e. Although the two ends of the arc 
approach other from opposite sides of the branch cut as e > 0*, they differ in phase by 27. 
Therefore, although the branch cut permits the inverse function z = Log[w] to be defined 
as a single-valued mapping w => z, that function is discontinuous across the branch cut. 
Often single-valuedness comes only at the expense of discontinuities. 

Neighboring strips y, = y, + nz simply remap the entire w-plane. One might imagine 
an infinite collection of w-planes, called Riemann sheets, stacked on top of each other such 
that curves which cross the branch cut move from one Riemann sheet to the next. The 
index n then identifies a particular Riemann sheet, with the principal branch represented 
by n = 0. Thus, it is useful to distinguish between a multivalued log and a single-valued 
Log defined by 


w == & = > z= log[w] = log[lwl] + i arg[w] (1.49) 
= log[|wl] + i Arg[w] + 2zin = Log[w] + 2zin 


As a curve winds around the origin of the w-plane in a counterclockwise sense, the 
argument increases continuously and each time one crosses the branch cut one moves 
from one sheet to the next and increments the winding number n by one unit. Clockwise 
winding decrements n, which is permitted to be negative also. Furthermore, the choice 
Yo = 0 is not unique and other choices would rotate the branch cut in the w-plane. The 
single-valued function produced by the most common choice of branch cut is described as 
the principal branch, but for some problems it may become convenient to make a different 
choice. However, the branch cuts used for Arg, Log, and related functions are correlated 
and must be chosen consistently throughout a particular calculation. 

Physics calculations normally must produce a unique answer that can be compared 
with a measurable quantity, such that physical functions must be based upon single-valued 
functions. Similarly, if one is to compute the value of an expression using a computer 
program, there must be a unique result. Numerical methods cannot tolerate multivalued 
expressions — the programmer must provide an unambiguous prescription for selecting the 


16 1 Analytic Functions 


appropriate branches of multivalued functions; a machine cannot perform that job for you. 
It is useful to visualize a function as a machine. When you supply appropriate input, it 
produces a definite and predictable output. A function is not really defined until its branch 
cuts and its discontinuities across those cuts are completely specified. Furthermore, there 
is often considerable flexibility in the selection of cuts that can be exploited to simplify 
the problem at hand, one selection for one problem and another for the next. Therefore, 
one must always be aware of the branch cuts used to regularize an inherently multivalued 
function. 


1.3.2 Mapping: w = Sin[z] 
The sine function is extended to complex variables by the definition 

Sini] = ——_*— (1.50) 
Using 


ant Se e~(Cos[x] + i Sin[x]) — e?(Cos[x] — i Sin[x]) 


e+e” ae e? a 
= > Sin[x] + i Cos[x] 
and the familiar definitions 
y -y Yop 
Cosh{y] = E Sinh[y] = £ > (1.52) 


for real variables, the components of the sine function become 
u + iv = Sin[x + iy] = u = Sin[x] Cosh[y] v = Cos[x] Sinh[ y] (1.53) 


The mapping of (x, y) coordinate lines is illustrated in Fig. 1.13. Lines of constant x are 
mapped into hyperbolae while lines of constant y are mapped into confocal ellipses with 
foci at (u,v) = (+1,0). The definition of the inverse mapping z = ArcSin[w] requires 
branch cuts because any strip |x—x,| < 7 is mapped onto the entire w-plane. It is customary 
to map the principal branch of ArcSin onto the strip -f < x < 5, but two choices remain 
for the branch cuts. Recognizing that as |x| > 5 the hyperbolic images of the vertical lines 
collapse upon the real axis, the most common choice is to place the branch cuts along the 
open interval |u| > 1. This choice reduces to the standard definition of ArcSin[x] for real 


arguments in the range |x| < 1. 
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Figure 1.13. Mapping: u + iv = Sin[x + iy]. Lines of constant x are mapped onto hyperbolas while 
lines of constant y are mapped onto confocal ellipses. 


1.4 Elementary Functions and Their Inverses 
1.4.1 Exponential and Logarithm 


Some properties of the exponential are preserved by extension from the real axis to the 
complex plane. For example, using 
ee? = e e” (Cos[y,] + i Sin[y;]) (Cos[y,] + i Sin[y,]) 
= e" (Cos[y,] Cos[y,] — Sin[y, ] Sin[y,] 


she (1.54) 
+i(Sin[y,] Cos[y>] + Cos[y,] Sin[y,]})) 
= e (Cos[y, + y,] + ¿Sin[y, + y,]) 

we find 

ea ga = guta (1.55) 
Similarly one can easily prove 

1 z al mn 

apo Euge (1.56) 

el e” 
and 

(ey =e” for ne Integers (1.57) 
for integer n. However, one must generally assume that 

(E) 4 e? (1.58) 


for arbitrary powers z,. For example, (e*)'/” is multivalued, producing n values for inte- 
ger n, while e*/” represents a unique complex number. 
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We define the multivalent logarithm function w = log[z] in terms of the solutions to 
the equation z == e”, such that 


_-~ pW 
=e’, 


Lt ; ; , (1.59) 
z = llets = log[z] = log[Izl] + iarg[z] = Log[lzl] + i Arg[z] + 2rin 


where the principal value is used for the logarithm of the positive real number |z| and where 
n is an arbitrary integer. Thus, this version of the logarithm function produces infinitely 
many values for any z . Consequently, we cannot simply replace log[e*] by z during calcu- 
lations because 


¿=x+ iy => & = ee” = logje] = Logfile] + i Arg[e*] + 2rin 
=x + i(y + 27m) (1.60) 


=z+2zin 


is ambiguous. By selecting n > 0, we define the single-valued principal branch as 


Loglz] = Log[lzl] + i Argiz] (1.61) 
and obtain 
Logle*] = z (1.62) 


as expected. On the other hand, some functional relationships that pertain to real arguments 
remain true for the multivalent version but are not necessarily true for the principal branch. 
For example, from 


log[z,z,] = logllz, 2,1] + éarg[z,z,] 
= logllz, llz,1] + i(arglz;] + arglz,]) (1.63) 
= logllz,1] + logllz,1] + i(arglz,] + arglz,1) 


we find 

log[z,z,] = loglz,] + log[z,] (1.64) 
but the corresponding relationship for the principal branch 

Log[z,z,] = Log[z,] + Log[z,] + 27in (1.65) 
is more complicated because we must deduce the appropriate n from the phases of z, and 


Z2. 


1.4.2 Powers 


Powers of a complex number are defined by 


2" = Exp|q@log[z]] = Exp|a Log|lzl]] Exp[ia arg[z]] (1.66) 
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and are generally multivalued. In polar notation we may write 
z= re” => z" = Exp]a Log[r]] Exp[ia(@ + 2zn)| (1.67) 


where n is an integer. This definition conforms to simple expectations for rational expo- 
nents and preserves the algebraic relationships 


(eF = Exp|£ log[z"]] = Exp|Ba log[z]] =z (1.68) 
(2,22)" = Exp[elog[z,2,1] = Exp]e(doglz,] + loglz,D] = 2725 (1.69) 


However, these algebraic relationships are generally multivalent. If we use a branch cut for 
the logarithm under the negative real axis, the same branch cut must be used for multivalent 
powers that are complex, nonintegral, or have negative real parts. The discontinuity in the 
argument of z“ across the branch cut is then 


A Arg[z*] = Limit[Arg[x + ie] — Arg[x — ie], e > 0] = Mod[2x0, 27] (1.70) 


If a is rational there are a finite number of Riemann sheets, but for irrational «œ there are 
an infinite number of Riemann sheets. The principal branch is given by 


principal branch: z“ = Exp|a Loglz]] = Exp|a Logllz1]] Expl io Arg{z]| (1.71) 


1.4.3 Trigonometric and Hyperbolic Functions 


Recognizing that 


E e* + po f E e* — gir _ Sin[x] 
Cos[x] = > Sin[x] = zi Tan[x] = Gosid (1.72) 
ee” ; _e-e* B Sinh[x] 
Cosh[x] = > Sinh[x] = > Tanh[x] = Cosh (1.73) 
for x € R, we define 
pt + pe eË — eË Sin[z] 
De = —___ T — 1.74 
Cos[z] > Sin[z] > an[z] Coal (1.74) 
E +e” . E -— e”? Sinh[z] 
Cosh[z] 5 Sinh[z] 5 anh[z] Cola] (1.75) 
for z € C. Inverting these expressions gives 
e” = Cos[z] + i Sin[z] e“ = Cos[z] — i Sin[z] (1.76) 
e€ = Cosh[z] + Sinh[z] e * = Cosh[z] — Sinh[z] (1.77) 
Obviously, 
Cos[-z] = Cos[z] Sin[—z] = — Sin[z] Tan[z] = — Tan[z] (1.78) 


Cosh[-z] = Cosh[z] Sinh[—z] = — Sinh[z] Tanh[z] = — Tanh[z] (1.79) 
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and 


Cos[iz] = Cosh[z] Sin[iz] = ¿Sinh[z] Tan[¿z] = i Tanh[z] (1.80) 
Cosh[iz] = Cos[z] Sinh[¿z] = ¿Sin[z] Tanh[¿z] = i Tan[z] (1.81) 


Multiplying out the expressions 


Expli(z, + 2,)] = Cos[z, + 22] + ¿Sin[z, + 22] 
= (Cos[z,] + ¿Sin[z, )[Cos[z>] + i Sin[z,]) (1.82) 
= epa 

Expli(z, — 2,)] = Cos[z, — z2] + ¿Sin[z, — z2] 
= (Cos[z,] + i Sin[z, ])(Cos[z,] — i Sin[z,]}) (1.83) 


izo 


= eie 


one quickly deduces the addition formulae 


Cos[z, + z,] = Cos[z,] Cos[z,] — Sin[z; ] Sin[z,] (1.84) 
Cosh[z, + z,] = Cosh[z, ] Cosh[z,] + Sinh[z, ] Sinh[z,] (1.85) 
Sin[z, + z2] = Sin[z,] Cos[z,] + Cos[z,] Sin[z,] (1.86) 
Sinh[z, + z,] = Sinh[z,] Cosh[z,] + Cosh[z, ] Sinh[z,] (1.87) 
and 
Cos[z]? + Sin[z]? = 1  Cosh[z]? — Sinh[z]? = 1 (1.88) 


Combining these results, we obtain 


Cos[x + iy] = Cos[x] Cosh[y] — i Sin[x] Sinh[y] (1.89) 
Cosh[x + iy] = Cosh[x] Cos[y] + i Sinh[x] Sin[y] (1.90) 
Sin[x + iy] = Sin[x] Cosh[y] + iCos[x] Sinh[y] (1.91) 
Sinh[x + iy] = Sinh[x] Cos[y] + ¿Cosh[x] Sin[y] (1.92) 


for {x, y) € IR. Expressions for the real and imaginary components of inverse trigonometric 
functions are developed in the exercises. 


1.4.4 Standard Branch Cuts 


Although there is often some flexibility in the choice of branch cuts, the cuts for related 
functions are correlated. Table 1.3 lists the standard choices for elementary functions, but 
other choices can facilitate certain calculations. Parentheses (square brackets) indicate an 
open (closed) interval. 
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Table 1.3. Standard definitions for principal branch of elementary functions. 


Function Branch cuts 
Abs none 

Arg (ox, 0) 
Sqrt (oo, 0) 

z*, nonintegral s with Re[s] >0 (oo, 0) 

z, nonintegral s with Re[s] <0 (oo, 0] 
Exp none 

Log (ox, 0] 
trigonometric functions none 


ArcSin, ArcCos 


(=oo, —1) and (1, co) 


ArcTan (=100, —i] and [i, ico) 
ArcCsc and ArcSec (-1,1) 

ArcCot [—z, i] 

hyperbolic functions none 

ArcSinh (—ioo, —i) and (i, ico) 
ArcCosh (cv, 1) 

ArcTanh (Zoo, —i] and [i, ico) 
ArcCsch (-i, i) 

ArcSech (oo, 0] and (1, co) 
ArcCoth [-1, 1] 


1.5 Sets, Curves, Regions and Domains 


The basic concept used to characterize sets, curves, and regions in the complex plane 
is neighborhood. A neighborhood of zo consists of the set of all points that satisfy the 
inequality |z — Zol < e; the radius e is usually assumed to be small. A point z is an interior 
point of the set S if there exists a neighborhood containing only points belonging to S. 
Conversely, a point is exterior to S if there exists a neighborhood that does not contain any 
points belonging to S. Finally, a boundary point is neither interior nor exterior to S because 
any neighborhood, no matter how small, contains both points which belong to S and points 
which do not. An open set is a set for which every point is an interior point; in other words, 
an open set contains none of its boundary points. A closed set, on the other hand, contains 
all of its boundary points. The closure of S consists of S plus all of its boundary points and 
is denoted S. Note that some sets, such as O < |z| < 1, are neither open nor closed because 
they contain some but not all of their boundary points, while C is both open and closed 
because there are no boundary points. A set is bounded if all points lie within a disk |z| < R 
for some finite R and is unbounded otherwise. Finally, a point z, is an accumulation point 
of S if every neighborhood contains at least one other point that also belongs to S. Thus, a 
closed set contains all of its accumulation points and, conversely, any set which contains 
all of its accumulation points is closed. For example, the origin is the only accumulation 
point of the set {z,, = L, n= 1, oo}. 

Any set of points that consists only of boundary points constitutes a curve. For exam- 
ple, the set of points that satisfy the equation |z — zol == R describes a circle of radius R 
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=LI 


Figure 1.14. Left: A simply connected domain; right: a mutiply connected domain. 


centered on (xp, Yo) and is an example of a curve. An arc is a curve described by the para- 
metric equation z = (x[t], y[t]) where x[t] and y[f] are continuous real functions of the real 
variable fain < É < fmax: An arc is simple if it does not intersect itself, in other words if 
ti # ty = zit] + zt] for tnin < t,t < fmax: A simple closed curve does not intersect 
itself except at the endpoints where z[fnin] = Zltnax]- 

An open set is connected if any pair of points can be joined by a polygonal path that 
lies entirely within the set. An open connected set is called a domain. For example, the 
annulus 1 < |z| < 2 is a domain because it is open and connected. Any neighborhood is 
also a domain. A domain D is described as simply connected if all simple closed curves 
within D enclose only points that are also within D and is described as multiply connected 
otherwise. A domain together with a subset of its boundary points (none, some, or all) is 
called a region. For example, {z 9 Re[z?] > 1 A Re[z] > 0} describes a simply connected 
domain while {z > 1 < |z| < 2} describes a multiply connected region. 


1.6 Limits and Continuity 


The limit of f[z] as z > zo is defined to be the complex number wọ if for each arbitrarily 
small positive number e there exists a positive number 6 for which 0 < |f[z] — wol < € 
whenever 0 < |z — zol < ô. Geometrically, this definition requires that the image w = f[z] 
for any point z in a ó-neighborhood of zy, with the possible exception of zy itself, should 
lie within an e-neighborhood of wọ. Note that this definition requires all points in the 
neighborhood of z, to be mapped within the neighborhood of w, but does not require the 
mapping to constitute a domain because the mapping need not produce a connected set. 
Furthermore, the limit z > z ) may be approached in an arbitrary manner. However, the 
present definition does not apply to points z) which lie on the boundary of the domain on 
which f[z] is defined because in that case the ó-neighborhood contains points at which 
fiz] may be undefined. Nevertheless, we can extend the definition of limit by limiting the 
requirements on the inequalities to those points in the neighborhood of z, that lie within 
the domain of f. 

Direct application of the definition of limits can be quite cumbersome, but a few almost 
self-evident theorems are quite helpful. 
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Theorem 1. Let f[z] = u[x, y] + iv[x, yl, Z = Xo + EY, and wg = Ug + ivg. Then 


Lim flz] = wo (1.93) 
232 
ifand only if 
Lim ulx, y] =u, and Lim vix y] = vo (1.94) 
(xy)>(X0,Y9) (5,3) (%0,Yo 


Theorem 2. Let fy = Lim,_,., fiz] and gy = Lim,,,. glz]. Then 
1. Lim, Lz] + glz]) = to + 80 


2. Lim,,,, fzlglzl = fogo 


3. Lim, E = 2 if g #0 


252% giz] 
4. Lim... Abs[f[z]] = Abs[ fo] 


A function f[z] is continuous at zo if lims flz] = flzọ] and is continuous in a 
region R if it is continuous at all points within that region. Note that this definition implic- 
itly requires f[z] and its limit at z, to exist. 


Theorem 3. Let f[z] be defined in a neighborhood of z) and suppose that for all points in 
that neighborhood f[z] lies within the domain of g[z]. Then if f(z] is continuous at z) and 
giz] is continuous at f[Z], it follows that g| f[z]] is continuous at Zp. 


Consider a sequence of complex numbers (z,,). The limit of a sequence z, > w requires 
that |z, —w| < e whenever n > N[e]. A Cauchy sequence requires |z, —z,,| > 0 as n,m > ov, 
A sequence converges if and only if it is a Cauchy sequence. 


1.7 Differentiability 
1.7.1 Cauchy—Riemann Equations 


Let w = f[z] = ulx, y] + iv[x, y] be a function of the complex variable z = x + iy and define 
its derivative by 
d + Az] - 
W _ tim HR +42] - feel 


m.t“£s.mt- 


1. 
dz Az>0 Az Az>0 Az ( 95) 


Although this definition is simply the obvious generalization of the derivative of a real- 
valued function of a real variable, the higher dimensionality of complex variables imposes 
nontrivial requirements upon differentiable complex functions. The existence of such a 
derivative requires 


1. f[z] be defined at z 
2. fiz] # oo 


3. the limit must be independent of the direction in which Az > 0. 
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The independence of direction is a strong condition which leads to the Cauchy—Riemann 
equations, henceforth denoted CR. Approaching the limit using variations along coordi- 
nate directions, one finds 


hoe eas. iin u[x + Ax, y] + iv[x + Ax, y] — ulx, y] — ¿vLx, y] 


Ax>0,Ay=0 Ax ( 1 96) 
= u + > 
Ox Ox 
ten a u[x, y + Ay] + iv[x, y ad Ay] — u[x, y] — iv[x, y] 
Ax=0,Ay>0 ¿Ay 1.97) 
Ou a Ov a. 
E A 
dy ðy 
Equating the real and imaginary parts separately, then requires 
ô ô 0 ô 
“Z A-T (1.98) 


The CR equations are necessary but not quite sufficient to ensure differentiability. To obtain 
sufficient conditions, we also require continuity of the partial derivatives of component 
functions. 


Theorem 4. Let f[z] = u[x, y] + iv[x, y] be defined throughout a neighborhood |z — zol < e 
and suppose that the first partial derivatives of u and v wrt x and y exist in that neigh- 
borhood and are continuous at zù = (Xo, Yo). Then f'[z] exists if those partial derivatives 
satisfy the Cauchy—Riemann equations 


= =- 1.99 
Ox oy Oy Ox vee 


Conversely, if f’[z] exists, then the CR equations are satisfied. 


If f[z] is differentiable at zọ and throughout a neighborhood of z,, then f[z] is described 
as analytic (or regular or holomorphic) at zọ. If f[z] is analytic everywhere in the finite 
complex plane, it is described as entire. Examples of entire functions include Exp, Sin, 
Cos, Sinh, and Cosh. Functions which are analytic except on branch cuts include Log, 
ArcSin, ArcCos, ArcSinh, and ArcCosh. 

Recognizing that the CR equations are linear, it is trivial to demonstrate that if f, [z] and 
f,[z] are analytic functions in domains D, and D,, then any linear combination a/f,[z] + 
bf,[z] is also analytic in the overlapping domain D = D, N D,. Similarly, it is straight- 
forward, though tedious, to demonstrate that the product f,[z]f,[z] also satisfies the CR 
equations and, hence, is analytic in D. Furthermore, one can show that 1/ f,[z] is analytic 
in D, where f,[z] + 0 such that f,[z]/f,[z] is analytic in D except possibly at the zeros of 
the denominator. Finally, if f, [w] is analytic at w = f,[z], then f,[f,[z]] is analytic. Formal 
demonstration that these familiar properties of derivatives also apply to analytic functions 
of a complex variable is left to the student. 
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Example 


ð ô ô ð 
fese2 u=- v= => A, ES e (1.100) 
Ox Oy oy Ox 
The partial derivatives are continuous throughout the complex plane and satisfy the CR 
equation; hence, z? is entire. In fact, one can show that any polynomial in z is entire. 


Example 


fld==u=x ve-y> S142 =-1 (1.101) 


The partial derivatives are continuous, but do not satisfy CR; hence, z* is nowhere differen- 
tiable and is not analytic anywhere. It is important to recognize that functions of a complex 
variable can be smooth and continuous without being differentiable. The requirements for 
differentiability are stricter for complex variables than for real variables because indepen- 
dence from direction imposes correlations between the dependencies upon the real and 
imaginary parts of the independent variable. Analytic functions of one complex variable 
are not simply functions of two real variables. 


1.7.2 Differentiation Rules 


Many of the familiar differentiation rules for real functions can be applied to complex 
functions. Suppose that f[z] and g[z] are differentiable within overlapping regions. Within 
the intersection of those regions, we can derive differentiation rules using the definition in 
terms of limits. Alternatively, by separating each function into real and imaginary compo- 
nents, one could also employ the CR relations. 

For example, one quickly finds that the derivative of a sum 


Flz] = fiz] + glz] 


> = _ _ (1.102) 
— lim Fiz + Az] - Fiz] _ lim fiz + Az] — fiz] tim glz + Az] — elz] 
Az>0 Az Az>0 Az Az>0 Az 


reduces to the sum of derivatives 
Fiz] = fiz] + gk] = F'le] = f'k] + g' [2] (1.103) 


if both functions are differentiable. Similarly, the familiar rule for a differentiation of a 
product 


Fle + Az] ~ Fiz] _ įm fle + Asle + Az] — flelstz] 
Az J Az>0 Az 


Flzl = fklelzl = im 
(1.104 
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is obtained using f[z+Az] ~ fiz] + f’[z]Az and g[z+Az] ~ elz] + 9’[z]Az for differentiable 
functions and retaining only first-order terms, 


lim fiz + Azlglz + Az] - flzlglz] _ im Fale IzJAz + f'izlelz]Az 


Az>0 Az im Az ad ` 1a 
such that 
Fiz] = flzlslz] = F'[z] = flzlg'[z] + f'[zlelz] (1.106) 


By similar reasoning one can verify all standard differentiation rules, subject to obvious 
conditions on differentiability of the various parts. Perhaps the most important is the chain 
rule 


Fiz = gL£1211 = F'i = (8 wf E), yg (1.107) 


provided that f is differentiable at z and that g is differentiable at w = f[z]. 


1.8 Properties of Analytic Functions 


Suppose that f[z] = ulx, y] + iv[x, y] is analytic in domain D and suppose that the sec- 
ond partial derivatives of the component functions u and v are continuous in D also. (We 
will soon prove that analytic functions are infinitely differentiable so that the component 
functions u and v must have continuous partial derivatives of all orders within D.) Differ- 
entiation of the CR equations then gives 

ðu Ov Pu ov Ov Fu u Cu 

=D a Gy a ey ee) 
Ov Ou ev Pu Pu Ov Pv Py 7 


ax dy dx ay dydx dp ae dp 


0 (1.109) 


Therefore, both the real and imaginary components of f are harmonic functions that satisfy 
Laplace’s equation. Furthermore, comparing the two-dimensional gradients 


Vu =&— +9— =8— -$ =AxW (1.110) 


Vv =8— +9— = -8— +9— = -Âx Vu (1.111) 


Vu-Vv=0 (1.112) 


we find that lines of constant u (level curves) are orthogonal to lines of constant v anywhere 
that f’[z] + 0. (Here fi represents the outward normal to the xy-plane.) If u represents a 
potential function, then v represents the corresponding stream function (lines of force), or 
vice versa. 

Consider, for example, f[z] = z? with u = x? — y? and v = 2xy. If we interpret v as an 
electrostatic potential, then u represents lines of force. Figure 1.15 shows equipotentials 
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Figure 1.15. Level curves for f[z] = z = u + iv are shown as solid for v and dashed for u. If the 
solid lines are interpreted as equipotentials, the dashed lines with directions given by —Vy represent 
lines of force. 


as solid lines, positive in the first quadrant and alternating sign by quadrant, and lines of 
force as dashed lines. The arrows indicate the direction of the force, as prescribed by —Vv. 
If electrodes were shaped with surfaces parallel to equipotentials, the interior field would 
act as an electrostatic quadrupole lens, focussing a beam of positively-charged particles 
along the 45° and 225° directions and defocussing along the 135° and 315° directions. 
Alternatively, if v represents a magnetostatic potential, then u would represent magnetic 
field lines. A beam of positively-charged particles moving into the page would be verti- 
cally focussed and horizontally defocussed by a magnetic quadrupole lens whose iron pole 
pieces have surfaces shaped by v « xy. 

It is also easy to demonstrate that, although harmonic functions may have saddle 
points, they cannot have extrema in the finite plane. Hence, neither component of an ana- 
lytic function may have an extremum within the domain of analyticity. Figure 1.16 illus- 
trates the typical saddle shape for components of an analytic function. Furthermore, the 
average value of a harmonic function on a circle is equal to the value of that function of 
the center of the circle. Proofs of these hopefully familiar properties of Laplace’s equation 
are left to the exercises. 

Suppose that Z, is a curve in the z-plane represented by the parametric equations z; [t] = 
{x,[¢], y, [¢]} and that f[z] is analytic in a domain containing Z,, such that the image W, of 
that curve in the w-plane is represented by w,[f] = f[z,[t]]. The slopes of tangent lines at 
a point zo and its image wọ are related by the chain rule, such that 


wile] = Faz le] = arg[w;[1]] = arg[z,[+]] + are[ £ [zo] (1.113) 


28 1 Analytic Functions 


Figure 1.16. Typical saddle: u = x? — y?. 


Thus, the mapping f[z] rotates the tangent line through an angle arg[f’[z,]]. The tangent 
to a second curve which passes through the same point z, is rotated by the same amount, 


wild = fall = arg[w>[1]] = arg|z5[4]] + arg[ fol] (1.114) 
such that angle between the two curves 
arg[w)] — arg[w;] = arg[z,] — arg[z;] (1.115) 


is unchanged by the conformal transformation specified by an analytic function f[z]. Sim- 
ilarly, distances in the immediate vicinity of zy are scaled by the factor |f”[z¿]1, such that 


lw — wol = If’ [zglllz — zol (1.116) 


Therefore, the image of a small triangle in the z-plane is a similar triangle in the w-plane 
that is generally rotated and scaled in size. 


1.9 Cauchy—Goursat Theorem 
1.9.1 Simply Comnected Regions 


We have seen that the components of analytic functions are harmonic and might be stim- 
ulated to pursue analogies with potential theory as far as possible. Remembering that the 
line integral about a closed path vanishes for a potential derived from a conservative force, 
we seek to evaluate 


G flaldz = fiuds-vdy) +iqpudy +vdo (1.117) 
c C c 


for an analytic function f = u + iv of z = x + iy where u[x, y] and v[x, y] are real. If 
we require P[x, y] and Q[x, y] to be differentiable within the simply connected region R 
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enclosed by the simple closed contour C, we can apply Stoke’s theorem to prove 


2 er 
firar+oay= | (5 yid (1.118) 
Let 
=200=s vip. 2-2 (1.119) 
Ox Ody 


where fi is normal to the xy-plane and use di = (dx, d y, 0) as the line element and da = 
ndx dy as the area element to obtain 


pañ-9= [45 9x0= ferdr+ody= | (2 -Zjara (1.120) 
C R C R\ Ox oy 


Applying this result, known as Green’s theorem, to the real and imaginary parts of the line 
integral separately, and using the CR conditions for analytic functions, we find 


ftudx - vdy) = {- ee - Saray = 0 (1.121) 
Ox oy 


Quay + vas) = E - op) trd = 0 (1.122) 


and conclude that 
f analytic for z within C => VHG dz=0 (1:123) 
C 


This result was first obtained by Cauchy, but was later generalized by Goursat. The deriva- 
tion above requires not only that f'[z] exist throughout R, but also that it be continuous 
therein. The latter restriction can be removed. 


Theorem 5. Cauchy—Goursat theorem: If a function f[z] is analytic at all points on and 
within a simple closed contour C, then fo fiz]dz = 0. 


1.9.2 Proof 


Consider the closed contour C sketched in Fig. 1.17. Divide the enclosed region R into a 
grid of squares and partial squares, whereby 


dz = d 1.124 
dra z Lp ra z (1.124) 


where the contributions made by shared interior boundaries cancel such that the net con- 
tour integral is the sum of the exterior borders of outer partial squares. For each of these 
cells, we construct the function 


[z] 
zq = Ee - fle (1.125) 


Zj 
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x=Re[z] 


Figure 1.17. Proof of the Cauchy—Goursat theorem. Contours about four of the interior squares are 
labeled C,_,. If f[z] is continuous, contributions to the contour integral from shared sides cancel, 
leaving only the outer border C that passes through partial squares. In the partial square, labeled Cs, 
we identify two distinct points labeled z and z;. 


where z and z; are distinct points within or on C; and evaluate its largest modulus 


fkl- fiz] 
CE 


Zj 


ô; = ve =f iki 


| (1.126) 


For any positive value of e, a finite number of subdivisions is sufficient to ensure that all 
ô ¡<e because f[z] is differentiable. Thus, we can now write 


fal = fll + (Fi; + ôl 2,1) 2, (1.127) 
for any z C C;, such that 
$ fiz] dz = senp dz + f'[z;] $ (2-z,)dz+ $ ólz, 2,2 — z;) dz (1.128) 
G; "de, E: G 


The first two terms obviously vanish, leaving 


pad: 0 ólz, 2,1% - z;) dz (1.129) 


which can be bounded by 


[z] dz] < 
bad = Y, 


(1.130) 


$ ólz 2,1 - z;) dz 


C; 
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Figure 1.18. Wedge contour used for J Cos[x?] dx. 


Ifs f is the length of the longest side of partial square C o then |z- z jl < y 2s p Furthermore, 
lô jl < e, such that 


lz, z; (z -z;)d 
$ [z zz =2;) dz 


where L, is the length of that part of C, that coincides with C. Because each factor is 


< V2s,6(4s, + L) (1.131) 


bounded and e > 0 may be taken arbitrarily small, we find that | be fiz] dzl is also arbitrar- 
ily small and, hence, must vanish. Therefore, the Cauchy—Goursat theorem is established 
without assuming that f’ is continuous. 


1.9.3 Example 


Contour integration of analytic functions provides powerful new methods for evaluation 
of otherwise intractable definite integrals. Although we will consider a wider variety later, 
for now consider the integral 


i > Cos[x?] dx (1.132) 
0 


which arises in the Fresnel theory of diffraction. It appears to be difficult to evaluate this 
integral using standard methods for real variables; nor is it obvious that this integral even 
converges. On the other hand, the Cauchy—Goursat theorem ensures that 


I= Q expli dz=0=1,+h+1, (1.133) 
C 


for a contour C consisting of a wedge of opening angle @ = 7 closed by a circular arc at 
R > oo; this contour is shown in Fig. 1.18. Consider first the circular arc where 


z= Re”? = e = Exp|éR? Cos[26]| Exp| -R° Sin[26]| (1.134) 
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Recognizing that O < Sin[26] < 1 is positive on the arc, the integrand is damped by a 
factor of order e~®’ such that 


R > œ => L =0 => h =-} (1.135) 
where 
L= f Cos[x?] dx + if Sin[?] dx (1.136) 
0 0 
The return line is represented by 
1+i 1l+i 2 2 
Z=—=t>dz= dte” =e" (1.137) 
y2 y 
such that 
1+i f p L+ivr 
L = — e" dt=- — (1.138) 
* Vada v2 2 


Therefore, equating real and imaginary parts, we find 


i Cos[x?] dx = i Sin[?] dx = (1.139) 
0 0 8 


rather easily. By representing the integrand in terms of analytic functions and choosing a 
clever contour, one can perform a surprisingly diverse variety of integrals relatively pain- 
lessly. In this case we even obtain two results for the price of one. (What a deal!) 


1.10 Cauchy Integral Formula 
1.10.1 Integration Around Nonanalytic Regions 


Suppose that the region R = R, + R, enclosed by the simple closed contour C includes a 
localized region R, where the function f is nonanalytic, but that f is analytic everywhere 
else within C. The Cauchy—Goursat theorem can be applied to such a region by deforming 
the contour in a manner that encapsulates the problematic region. Figure 1.19 illustrates 
this technique. The colored region represents the nonanalytic region R, and the outer circle, 
when closed, represents the contour C and is traversed in a positive, counterclockwise, 
sense. Note that C need not actually be circular, but it is easier to draw that way. We 
imagine drawing line A from C to a point just outside the nonanalytic region. The contour 
C, goes around this region in a negative, clockwise sense, remaining within the analytic 
region R}, ending close to its starting point. We then return along B to the contour C}. 
The common path AB traversed in opposite directions between inner and outer contours 
is sometimes called a contour wall and serves to create a simply connected region R, for 
which the Cauchy—Goursat theorem requires 


[sides f ptelde+ | faz [ ftelde=0 (1.140) 
C, A B G 
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Figure 1.19. Construction of a contour wall and demonstration that a contour within an analytic 
region may be shrink-wrapped around and enclosed nonanalytic region. 


Recognizing that, for a continuous integrand, the contributions of A and B must become 
equal and opposite as the separation between those paths becomes infinitesimal, we find 


| flaldz+ f fiad =0= op fid = - Flzl dz (1.141) 
A B G E 


Here the negative sign occurs because the inner contour is traversed in the opposite direc- 
tion when reached by means of the contour wall. Therefore, the original contour can be 
shrink-wrapped about the nonanalytic region without changing the value of the contour 
integral. 

Tf the path C encloses several localized nonanalytic regions, we simply construct sev- 
eral contour walls. The net contour integral is then just the sum of the contributions 
from shrink-wrapped contours around each nonanalytic region. Take care with the signs 
though — if the original contour is traversed in a positive sense, the nonanalytic regions 
are enclosed in a negative sense by the continuous deformed contour that circumvents 
nonanalytic regions. However, recognizing that the entire contour integral vanishes and 
that the contour walls cancel, the net integral for a simple contour that encloses nonan- 
alytic regions reduces to the sum of the contributions made by shrink-wrapped contours 
enclosing the nonanalytic regions in a positive sense. Therefore, if there are N isolated 
nonanalytic regions within the simple closed contour C, we find 


N 
dz = d 1.142 
$ fiz]dz 2 > s z (1.142) 


where each simple closed contour C, encloses one of the nonanalytic regions and is tra- 
versed with the same sense as the original contour C. 

We postpone consideration of extended nonanalytic regions to the next chapter, but in 
the next few sections consider the important special case of an isolated singularity within 
the contour. 
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1.10.2 Cauchy Integral Formula 


Suppose that the contour C lies within a region R in which f[z] is analytic, but that it 
surrounds another region R’ in which f is not analytic. We demonstrated above that the 
contour can be deformed, such that C > C’ where C’ is immediately outside R’, without 
changing the value of the contour integral 


VHG dz = $ fiz] dz (1.143) 
C (04 


Thus, a contour integral that encloses a single localized nonanalytic region can be shrink- 
wrapped about the border of that region. This result is particularly useful for the case of 
an isolated singularity for which the region of nonanalyticity consists of a single point Zp. 
Consider the integral 


afl 


c s-z 


(1.144) 


where f is analytic throughout the region enclosed by C while the integrand it singular 
at z. If z is outside C the integral vanishes because the integrand is analytic at all points 
within C. Alternatively, if z lies within C, we can reduce C to a small circle surrounding z, 
such that 


s-z = re? = ds = ire? de (1.145) 


Thus, the integral can be approximated 


ppi 
pas = fiz] $ me at = rifle] (1.146) 
c $2 E 


re 


to arbitrary accuracy as r > 0. Therefore, we obtain the Cauchy integral formula: 


Theorem 6. Cauchy integral formula: If a function f[z] is analytic at all points on and 


1 fis] . . . 
ant Pe e ds for any interior point z. 


This remarkably powerful theorem requires that the value of an analytic function at any 
interior point is uniquely determined by its values on any surrounding closed curve and is 


within a simple closed contour C, then f[z] = 


analogous to the two-dimensional form of Gauss” theorem. The behavior of an analytic 
function is severely constrained. 


1.10.3 Example: Yukawa Field 


Using elementary field theory, the virtual pion field surrounding a nucleon is represented 
in momentum space by 


N2 
ólal = — 1.147 
dl] Pane ( ) 
The spatial distribution is then obtained from the three-dimensional Fourier transform 
Pe nox 4n A? (* qSin[gr] 
E e Tr — d 1 . 148 
a= f Chea a a aq (1.148) 
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Figure 1.20. Great semicircle with enclosed pole at z = iA. 


where spherical symmetry and the multipole expansion of the plane wave have been used 
to reduce the integral to one dimension. (Alternatively, the angular integrals can be evalu- 
ated directly.) Recognizing that the integrand is even, we can write 


œ qSin{qr] 1 T qSin[gr] , 1 T q Expligr] 
dq = dq = d 1.149 
Í PP IN UT Tie pina T e! 
because the contribution from Cos[qr] vanishes by symmetry. Now consider the contour 
integral 


2 giz 
Hal = q Ez de (1.150) 
where 
glz] = Gam (1.151) 


is analytic in the upper half-plane. 

If we choose a contour C, shown in Fig. 1.20, consisting of the real axis and a semicircle 
in the upper half-plane with R => oo, affectionately called a great semicircle, this integral 
can be expressed as 


© gExpliqr] = f Exp|irRe”] 
I[A] = 2 gir == 4d 1.152 
[A] dl PAN q +2 o REO ( ) 
Using 
Exp|irRe“| = ExplirR Cos[0]] Exp[-rR Sin[8]] (1.153) 


and recognizing that Sin[@] > O in the upper half-plane, we realize that the contribution of 
the circular arc decreases exponentially with R and vanishes in the limit R > oo. Therefore, 
with the aid of the Cauchy integral formula 


I[A] = 2nigliA] = ine” (1.154) 
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we obtain the Yukawa field 


N2 eN 
oir = E 


(1.155) 


r 


that is central to the meson exchange model of the nucleon—nucleon interaction that binds 
atomic nuclei together. 


1.10.4 Derivatives of Analytic Functions 


Let 
He paw fiw] (1.156) 
2ni Je 


W-Z 


represent a function that is analytic in the domain D containing the simple closed con- 
tour C. First we demonstrate that differentiation can be performed under the integral sign. 
Using 


flz + Az] - fk] = if an fw fl) 
C w=z-—Az W-=Z 


E Z faw flw] 
2ri Jc  (w-z)(w -z - Az) 


we recognize that the left-hand side vanishes in the limit Az > O because f[z] is continuous 
and must demonstrate that the right-hand side shares this property. Recognizing that the 
integrand is analytic everywhere within C except at z, we may reduce the contour to a 
small circle of radius r around z. Let M = max[|f[w]l] on the reduced contour, and use the 
triangle inequalities to evaluate the maximum modulus of the integrand, such that 


flw] $ M 
A d < |A d 
<Q "o —a(w—z—Ag) Aal C lav — aw —z - Az) 
MlAz| 


"rr — Az) 


(1.157) 


(1.158) 
< 20 


vanishes in the limit Az > 0 for finite r. Thus, we find that 


cs Get Ae) Ale. 1 $ fiw] 
m = dw J 
c  (w-2) 


1 
aD Az 2nt 


1 d 
‘l= Od —(w-z)! (1 
=> fkl -= za d wfwlgz o 0159 
is also analytic within D. Repeating this process, we obtain 
a" fiz] == ha fiw] 
C (w = gal 


dz” ni 
by induction. Therefore, we have demonstrated by construction the remarkably powerful 
theorem that analytic functions have derivatives of all orders. This also requires all partial 
derivatives of its component functions to be continuous in D. This theorem will soon be 
used to derive series representations of analytic functions. 


Fl = (1.160) 
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Theorem 7. /fa function f is analytic at a point, then derivatives of all orders exist and 
are analytic at that point. 


1.10.5 Morera’s Theorem 
The converse of Cauchy—Goursat theorem is known as Morera’s theorem: 


Theorem 8. Morera’s theorem: If a function f[z] is continuous in a simply connected 
region R and be flz]dz = 0 for every simple closed contour C within R, then flz] is 
analytic throughout R. 


If every closed path integral vanishes, the path integral between two points in the 
domain of analyticity D depends only upon the end points and is independent of the path, 
provided that the path lies entirely within D. Hence, we define the function F[z] by means 
of the definite integral 


Flz,] - Flz,] = i flz] dz (1.161) 
Clearly, 
[Orta tea) dz = Fie- Fe- @ - afte (1.162) 


21 


such that the limit as z, > 2, 


E z fizi) dz : F[z,] = Fiz] 
m = lim 


2271 fy 7271 2271 fg 771 


- fiz] = F'[z,1- fly] (1.163) 


compares F’[z,] with f[z,]. However, the integral vanishes in the limit of vanishing range 
of integration because f[z] is continuous in D, such that 


SEFE- fle) dz 
m 


22571 Za 7271 


=0 = Fla] = fk] (1.164) 


Thus, F[z] is analytic in D with F’[z] = fiz]. Therefore, because the derivative of an 
analytic function is also analytic, we conclude that f[z] must also be analytic, proving 
Morera’s theorem. 

Morera’s theorem is sometimes useful for proving general properties for analytic func- 
tions of various types, but is rarely of practical value to more detailed calculations. 


1.11 Complex Sequences and Series 
1.11.1 Convergence Tests 


An infinite sequence of complex numbers {z,, = 1, 2,...} can be represented by com- 
bining two sequences of real numbers {x„ n = 1,2,...} and {y „n = 1,2,...} such that 


ne 
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Zz, = x, + iy, The sequence z, converges to z if for any small positive e there exists an inte- 
ger N such that |z,, —z| < € for n > N. Convergence of a complex series z, to z = x + iy then 
requires convergence of both x, to x and y, to y. Many of the properties of real sequences 
can be adapted to complex sequences with only minor and obvious changes. Therefore, 
we state without proof the Cauchy convergence principle: 


Theorem 9. The sequence {z,} converges if and only if for every small positive € there 


exists an integer N, such that |z,, — Z,| < € for any m,n > N.. 


If {z,,} and (w,) are two convergent sequences with limits z and w, then (az, + bw, ) and 
[z, w,) are also convergent sequences with limits az + bw and zw. 
An infinite series of complex numbers z, converges if the sequence of partial sums 


S, = E Zk (1.165) 
k=1 
converges to S, such that 


lim S, = S = S = 


n-oo 


(1.166) 


Me 
SS 


k=1 


If the sequence of partial sums does not converge, the corresponding series diverges. 
A series is absolutely convergent if the series of moduli 


> lal (1.167) 
k=1 


converges. An absolutely convergent series converges, but a convergent series need not 
converge absolutely. A convergent series that is not absolutely convergent is described as 
conditionally convergent. For example, the alternating harmonic series Y, (—)*/k con- 
verges conditionally but not absolutely because »¿2, k7* diverges. Term-by-term addition 
of convergent series yields another convergent series, but convergence of a series formed 
by termwise multiplication requires absolute convergence of the individual series. 

If {z,} does not converge to zero, the corresponding series diverges because the sequence 
of partial sums will not satisfy the Cauchy convergence condition. However, convergence 
of the sequence of terms to zero does not ensure convergence of the series. The most gen- 
eral analysis of a series separates its terms into real and imaginary parts and then applies 
one of the many tests developed for series of real numbers to the real and imaginary sub- 
series separately; the complex series then converges if both its real and imaginary subseries 
converge. However, it is usually simpler and often sufficient to test for absolute conver- 
gence instead. The following convergence tests familiar for real series can be generalized 
to complex series. 


Comparison test: If 0 < |z,| < a, for sufficiently large k and >, a, converges, then >, 2; 
converges absolutely. 
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Ratio test: If |z,,,/z,| < r for all k > N, then >, z, converges absolutely if r < 1. Alter- 
natively, if r = lim,_,,, 1z,4;/7Z,| the series converges absolutely if r < 1 but diverges if 
r > 1. This test is inconclusive if r = 1. 


Root test: If |z,|!“* < r for all k > N, then >, z, converges absolutely if r < 1. Alterna- 
tively, if r = lim, „„ Izl! the series converges absolutely if r < 1 but diverges if r > 1. 
This test is also inconclusive if r = 1. 


Integral test: Suppose that f[k] = |z,| where f[x] is defined for x = n = 1. The series then 
converges absolutely if the integral Í ™ Fix] dx converges. 


Note that the ratio test is indeterminate when lim} æ |z,,,/z,| = 1. For example, the har- 
monic series z, = k™! diverges while the alternating harmonic series converges. A “sharp- 
ened” version, established in the exercises, shows that a series converges absolutely if the 


ratio of successive terms takes the form 
an+ 


4; 


Ss 
=1-- (1.168) 


n 


for large n with s > 1. 
Often the terms of a series will themselves be functions of a complex variable, z, such 
that 


fa => ala (1.169) 
k=1 


represents a sequence (f,[z]) of partial sums. If such a sequence converges for all z in a 
region R, such that 


¿ER = fiz] = lim f,[z] = lim $) giz] (1.170) 
k=1 


then f,[z] is described as a series representation of the function f[z] valid within the con- 
vergence region R. Often the convergence region takes the form of a disk, |z — zol < R, with 
center zo and radius of convergence R. If a series converges for all z within |z — z)| < R 
but diverges for some points on the circle |z — zol = R, one still reports a radius of conver- 
gence R. The problem then is to determine the radius of convergence. 


Example 


What is the radius of convergence for a geometric series, y z*, extended to the complex 
plane? According to the ratio test, 


k+l 


zk 


Zk+1 
Zk 


= lal (1.171) 


this series converges absolutely for any |z| < 1 and diverges for |z| > 1. Thus, the radius of 
convergence is 1. Notice that even though the ratio test is inconclusive for |z| = 1, this series 
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clearly diverges on the unit circle because the terms do not approach zero. Alternatively, 
by the ratio test 


lim Izl! = lel (1.172) 


one finds convergence for |z| > 1 and divergence for |z| = 1. Furthermore, one can demon- 
strate that 


n 1 
ki < 1 = lim Jz E (1.173) 
k=0 
within the radius of convergence. Let 
1- 
fala == == (1.174) 


represent a sequence of complex numbers, where the last step is verified by direct multi- 
plication 


A-DI +z+ +... 2 =U 4722 4...42)-@t72+...42" 


ae (1.175) 


Then, separating the constant term (for fixed z) from the variable part of the sequence 


1 got 


fill] = 32271 (1.176) 


and recognizing 


ll <1= lim =0 (1.177) 
n>o0 —_ Z 
one finds that 
1 
ll < 1 = lim f,[z] = = (1.178) 
now — Z 


Therefore, the geometric series 


x 1 
ld<1= > w= ar (1.179) 
k=0 = 


converges to a simple analytic function within the unit circle, thereby extending a familiar 
result from the real axis to the complex plane. 


1.11.2 Uniform Convergence 


A sequence of functions (f,[z]) is said to converge uniformly to the function f[z] in a 


region R if there exists a fixed positive integer N, such that |f [z] — flz]l < e for any z 
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within R when n > N.. Consequently, a uniformly convergent series f [z] = Xz- g,Lz] 
provides an approximation to f[z] within R with controllable accuracy — there exists a 
finite number of terms, even if large, that guarantees a specified degree of accuracy any- 
where within the region of uniform convergence. The region of uniform convergence is 
always a subset of the region of convergence. For example, although the geometric series 
No 2 converges uniformly to (1 — z)™! within any disk |z| < R < 1 with less than unit 
radius and is convergent within |z| < 1, one cannot properly claim uniform convergence 
throughout the open region |z| < 1 because the convergence becomes so slow near the cir- 
cle of convergence that there will always be points within that region that require more than 
N terms to achieve the desired accuracy no matter how large N is chosen. Convergence at 
z without uniform convergence within the region of interest is described as pointwise. 

The most common test for uniform convergence is offered by the Weierstrass M-test: 
The series >, f,[z] is uniformly convergent in region R if there exists a series of positive 
constants M, such that |f,[z]l < M, for all z in R and >, M, converges. The proof follows 
directly from the comparison test. (For what it’s worth, M stands for majorant.) 

The follow theorems for manipulation of uniformly convergent series can be estab- 
lished by straightforward generalization of the corresponding results for real functions. 


Continuity theorem: auniformly convergent series of continuous functions is continuous. 


Combination theorem: the sum or product of two uniformly convergent series is uni- 
formly convergent within the overlap of their convergence regions. 


Integrability theorem: the integral of a uniformly convergent series of continuous func- 
tions is equal to the sum of the integrals of each term. 


Differentiability theorem: the derivative of a uniformly convergent series of continuous 
functions with continuous derivatives is uniformly convergent and is equal to the sum 
of the derivatives of each term. 


Furthermore, by combining these results one can obtain the more general Weierstrass the- 
orem establishing uniformly convergent series as analytic functions within their conver- 
gence regions. Thus, the property of uniform convergence is important because 1t makes 
available all theorems in the theory of analytic functions. 


Theorem 10. Weierstrass theorem: If the terms of a series >, 8,12] are analytic throughout 
a simply-connected region R and the series converges uniformly throughout R, then its sum 
is an analytic function within R and the series may be integrated or differentiated termwise 
any number of times. 


1.12 Derivatives and Taylor Series for Analytic Functions 
1.12.1 Taylor Series 


It is now a simple matter to demonstrate the existence of power-series expansions for 
analytic functions. Suppose that f is analytic within a disk |z — z,| < R centered upon zo 
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and assume that 


00 


fl = a, 2-2)" (1.180) 


n=0 


Consider the integral 


1 1 
L => dz 1.181 
k Pri $ (z - Z) ( ) 


evaluated on the circle |z — z,| = R. Using dz = iRe dé, we find that 


RIE 2r 


i= e0- do = Sh (1.182) 


27 0 
vanishes unless k = 1. Therefore, the coefficients of the power series can be evaluated 
according to 
1 (n) 

fa: EN 


A (1.183) 


a, 


Although we performed this calculation using circular contours, the same results would be 
obtained for arbitrary simple contours within the analytic region because the singularities 
in the integrands are confined to a single point, which can be excised. A power series 
centered upon the origin is sometimes called a Maclaurin series while a more general 
power series about arbitrary z, is called a Taylor series. 


Theorem 11. Taylor series: If a function f is analytic within a disk |z — zo| < R, then the 
(nf 

power series f[z] = X} -o 4,(Z — 2p)" with a, = f Tal converges to f(z] at all points within 

the disk. Conversely, if a power series converges for \z — zol < R, it represents an analytic 


function within that disk. 


It is instructive to demonstrate convergence of the power series directly. Expanding 


=i n k 
(s-37 =(s-2)" (: S ==) = (s-z)! [a + 2 (=>) | (1.184) 


where 


(z _ ro i 


=~ V 1.185 
(S — Zo)” (s — 2) ( ) 


n 


the Cauchy integral formula becomes 


n 


= 1 fis] = _ k = n+l 
Me $ ds: = 3 a(z — 2) + R,(Z— %) (1.186) 
where 
1 Fis] _ fP 
a, = mi P Om E (1.187) 
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as before and where the remainder takes the form 


aig. ntl 
1 fa Fs] (2-20) (1.188) 
C 


a” Oni NE = z) ce gay 
Identifying 
lz- z|=p, M=max{(lf[s]l], 6 = min[ls — zl] (1.189) 


and choosing a circular contour with 


Is-zl=r>p (1.190) 
we find 
n+l 
IRI <(£) ZM = limR,=0 (1.191) 
r ô n>00 


Thus, this power series converges throughout the region of analyticity that surrounds Zp. 
Therefore, the radius of convergence is the distance to the nearest singularity in the com- 
plex plane. With more careful analysis, one may find that a Taylor series converges at some 
points on the circle of convergence also. 

The Taylor series for f[z] about a point z) = (Xp, 0) on the real axis has the same form as 
the expansion of f[x] interpreted as a function of the real variable x. More importantly, this 
extension of the Taylor theorem to the complex plane often provides the simplest method 
for evaluating the radius of convergence of a power series. Consider the hyperbolic tangent 


2 17 2 1382 21844 
p=— z+ f Z- 38 gt $ z? +... (1.192) 
15% 315% ` 2835" 155925 6081075 


It is difficult to evaluate the general term and to deduce the radius of convergence from 
the real-valued series, but from the function of a complex variable we know immediately 
that the radius of convergence is 1/2 because the nearest roots of Cosh[z] are found at 
z = t17/2. 

Sometimes it is necessary to determine the radius of convergence directly from the 
terms of the power series. Then one finds 


1 
Tanh[z] = z- ae + 


-1 
== ) (1.193) 


using the ratio test, or 
R = lim la," (1.194) 
n-oo 


using the root test. For example, the Maclaurin series for Log[1 + z] takes the form 


Log[1 + z] = Not (1.195) 


n=1 


and one obtains a convergence radius R = 1 using the ratio test. In this case the conver- 
gence radius is limited by the branch point at z = —1. Notice that at z = 1 this power series 


44 1 Analytic Functions 


reduces to the alternating harmonic series 


29 (=)! 
y = Log[2] (1.196) 
n=1 

and thus converges for at least one point on the convergence circle, while at z = —1 the 


resulting harmonic series diverges. Applying the root test instead suggests a limit 


limn!” = 1 (1.197) 


n-oo 


that might not be obvious otherwise. 


1.12.2 Cauchy Inequality 
Let 


M[r] = max,- 


IF EII (1.198) 


Zol=r 


represent the maximum modulus of an analytic function on a circle of radius r surrounding 
Zo. We then find that 


1 M 
la,| < — $ dz— = Mr” = |a,|r" < M[r] (1.199) 
on perl 

constrains the coefficients of the Taylor series. 


Theorem 12. Cauchy inequality: If a function f[z] = Y y a, (2 — zp)” is analytic and 
bounded in D and |f[z]l < M on a circle |z — zol = r, then la, |r" < M. 


1.12.3 Liouville’s Theorem 


Theorem 13. Liouville’s theorem: If a function flz] is analytic and bounded everywhere 
in the complex plane, then f[z] is constant. 


According to the Cauchy inequality, if |f[z]| < M for |z| < R, then la, |R" < M. If this 
inequality applies in the limit R > oo, then we must require a, > 0 for n > 0. Therefore, 
if f is not constant, it must have a singularity somewhere. The behavior of functions of a 
complex variable is largely determined by the nature and locations of their singularities. 


1.12.4 Fundamental Theorem of Algebra 


Theorem 14. Fundamental theorem of algebra: Any polynomial P [z] = Yio 4,2" of 
order n = 1 must have at least one zero z > P [zo] = 0 in the finite complex plane. 


Although it is difficult to prove the fundamental theorem of algebra using purely alge- 
braic means, it is an almost trivial consequence of Liouville’s theorem. If P,[z] has no 
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zeros, then the function f[z] = 1/P,[z] would be analytic throughout the entire complex 
plane. Recognizing that 


zI = R > co = IP [z]l > la, IR" = fiz] > ae (1.200) 
n 

it is clear that f[z] is bounded. Thus, Liouville’s theorem requires f to be constant, but 
flz] vanishes in the limit R > oo, which contradicts the absence of zeros in P, . Therefore, 
,, must have at least one zero. Factoring out this root, we write P [z] = (z — z,)P,_,[z] 
and apply the theorem to P,_,, concluding that it must also have a root if n > 2. Repeating 
this process, we determine that a polynomial of degree n must have n roots, although some 

might be repeated. 


1.12.5 Zeros of Analytic Functions 


If a function f[z] is analytic at z, there exists a disk |z — z)| < R wherein the Taylor series 
converges, such that 
2 ml < R= fla = >) a,(c- 2%)" (1.201) 
n=0 
Suppose that z was chosen to be a root of f[z¿] == 0, such that a, = 0. If a, + 0, we 
describe zg as a simple zero of f, but if all a, = O with n < m while a,, + 0, we describe 
Zo as a zero of order m. It is then useful to express the Taylor series in the form 


fiz] = z- 29)" oiz] (1.202) 


where the auxiliary function 
gizl = Y amn- 20)" (1.203) 
n=0 
employing the coefficients a, with n = m has the nonzero value $[Z | = a,, at 2. Clearly 
$ is continuous at z, and is analytic within the radius of convergence. Therefore, for any 
small positive number e there exists a corresponding radius 6 such that 


lélz] =a,,| <£ whenever |z—z| < ô (1.204) 


m 


Suppose there were another point z, in a neighborhood of zọ where ¢[z,] = 0, such that 
iz] =0 = la,,| <.e whenever |z; -Zol <6 (1.205) 


can only be satisfied if a,, = 0, contrary to our assumption that z, is a zero of order m. 
Therefore, we conclude that if f is analytic and does not vanish identically, there must 
exist a neighborhood around any root in which no other root is found; in other words, the 
roots of analytic functions are isolated. 


Theorem 15. Suppose that a function f[z] is analytic at zy and that flz,] = 0. Then 


there must exist a neighborhood of z containing no other zeros of f unless f vanishes 
identically. 
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Figure 1.21. To develop the Laurent series, a small contour C within an analytic region is stretched 
toward the limits of the region of analyticity, indicated by C, and C,, with the aid of contour wall. 


1.13 Laurent Series 
1.13.1 Derivation 


A more general expansion which is useful in an analytic region that surrounds a nonana- 
lytic region is provided by the Laurent series. 


Theorem 16. Laurent series: If F[z] is analytic throughout the region R, < |z — zol < Ry; 
it can be represented by an expansion 


00 


fal = Y a,(2- 2)" (1.206) 


nN=—oo 


with coefficients 


_ af fleldz_ (1.207) 
E 


~ Oni (z- ae 


computed using any simple counterclockwise contour C within the analytic region. 


n 


If R; > 0 and coefficients with n < O vanish, then the Laurent series reduces to the 
Taylor series. 
Suppose that C is a small contour surrounding an interior point z, such that 


ftiz oat 
2mi Jo s-z 


(1.208) 


according to the Cauchy integral formula. As shown in Fig. 1.21, we can stretch C to 
the limits of the annulus without changing the integral because the integrand is analytic 
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throughout that region. Recognizing that the opposing segments of the contour wall cancel, 
we obtain 


aa 


: (1.209) 
s-z 2ni R oz 


fiz] = 


where R, and R, denote counterclockwise circles at the inner and outer borders of the 
analytic annulus. For the outer integral we employ the expansion 


-1 o0 n 
(s= 2)! =(s-2y" (: = =] =(s—z) | E =) (1.210) 


S — Zo A=0 S— Zo 


while for the inner integral 


-1 co 
_ yl - (7 -1 _ 57% = -1 S — Zo 
6-2) (Z-z) (: a) =- (z - zo) AE (1.211) 
such that 
2miflz] = De- as a + Ye-ar de '$ dsflslis= z)" (1.212) 


Both integrands are analytic throughout the annulus and are independent of z. Hence, these 
integrals can be evaluated using any simple closed path within the analytic region. There- 
fore, we may combine the two terms into a single expression 


00 


fl = Y, a,(z 2) (1.213) 
_ ol fiz] 
an = Oni faz _ g (1.214) 


representing the Laurent expansion. One can also show that the Laurent expansion about 
a specific zg is unique within its analytic annulus. 


1.13.2 Example 


The function 


1 
= ———_ 1215 
Meno (1.215) 


has singular points at z = 0, 1. Suppose that we evaluate the Laurent coefficients using 
contour integration 


ro? 27 ¿—i0n+2) 
ist) _ f EA (1.216) 
0 


a, = z 
n O st! 27 1 — Re” 
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on a circle with s = Re’ and ds = is d@. For R < 1 we can expand the integrand to obtain 


R2 29 


27 
R<l>a,= r f Exp[i(k — n — 2)0] d0 (1.217) 
0 


k=0 
Nonvanishing coefficients then require k = n + 2 and k = 0 = n > —2, such that 


co 


O<h les => 2 (1.218) 
n=0 


Alternatively, for R > 1 we use (1 — 2)! = "(1 — z"!)7! to obtain 


27 
R>1>4,=- F yet Exp[—i(k + n + 3)0] d0 (1.219) 
for which nonvanishing coefficients require k = —n — 3 and k = 0 => n < —3, such that 
I> l= fld=-> 2" (1.220) 


n=0 


Although the Laurent theorem provides an explicit formula for the coefficients, evalu- 
ation of the contour integrals is often difficult and one seeks simpler alternative methods. 
In this case we can use the partial fractions 


1 1 1 1 

= +-+ 1.221 

2d-D 2 z l-z ( ) 
and 
z| < 1 = : -Y2 (1.222) 
i 1- z 7 2a“ l 
1 1 = -n 

zi > 1 = = =- Z (1.223) 


l-z z -z') 


n=1 


to obtain the same results without integration. In other cases we may be able to convert a 
known Taylor series into a Laurent series. For example, 


Log[1 +z] =X oi for |z|<1 (1.224) 


n=1 
= Log [1+] = PAO tae for [zl > 1 (1.225) 
n=1 


where the latter is valid in the largest annulus that excludes the branch cut —1 < x < 1 on 
the real axis. 
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1.13.3 Classification of Singularities 


Suppose that f[z] is singular at z, but analytic at all other points in a neighborhood of zp; 
f is then said to have an isolated singularity at z). A function that is analytic throughout 
the finite complex plane except for isolated singularities is described as meromorphic. 
Meromorphic functions include entire functions, such as Exp, that have no singularities in 
the finite plane and rational functions that have a finite number of poles. Functions, such 
as Log, that require branch cuts are not meromorphic. 

The Laurent expansion about an isolated singularity takes the form 


00 


fld= >) aE- z)" (1.226) 


n=—oo 


If a,, + 0 for some m < 0 while all a <m 
and the coefficient a_, is called the residue of the pole. A simple pole has m = —1. If the 
function appears to have a singularity at zọ but all a,, vanish for m < O, z, is described as 
a removable singularity because the function can be made analytic simply by assigning a 


suitable value to f[zọ]. For example, z = 0 is a removable singularity of 


= 0, then z, is classified as a pole of order —m 


Sin] Wo E on 
z  44Qn+D! 


FR] = (1.227) 


because with the assignment f[0] = 1 the function is continuous and its Laurent series 
reduces to a simple Taylor series. 

If the Laurent expansion has nonvanishing coefficients for arbitrarily large negative 
n > —oo and the inner radius vanishes, then it has an essential singularity at z). According 
to Picard’s theorem, essential singularities have the nasty property that f[z] takes any, 
hence all, values in any arbitrarily small neighborhood infinitely often with possibly one 
exception. For example, 


ghey 2 (1.228) 


has an essential singularity at the origin. The equation w == e!” is satisfied by 


w= e =>7z= = (Log |w| + i Arg[w] + 2nxi)* (1.229) 


1 
Log[w] 
for any integer n. By choosing n sufficiently large, one can make |z| as small as desired. 
Thus, although e! + 0, the one exception, all other values of w are obtained infinitely 
often in a neighborhood of z > 0, as expected from Picard’s theorem. 

Singularities in f[z] at z = oo are classified according to the behavior of f[1] at z = 0. 
Thus, e* has an essential singularity at co, while z” is analytic at oo if n is a positive 
integer. 
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1.13.4 Poles and Residues 


Although the Laurent coefficients are defined in terms of an integral, it is usually easier to 
compute the coefficients using a derivative formula similar to that for the Taylor series. If 
fz] is analytic near z except for an isolated m-pole at zọ, we define an auxiliary function 


00 


gizl = @-%)"flel = >) a- (1.230) 


n=-=m 


that is analytic within |z — z)| < R where R is the radius of convergence for the Laurent 
series. The coefficients can then be obtained by differentiation, whereby 


1 grin 


a, = ——| 5 
" (m+n)! \dz"™ 


otal (1.231) 


2=2, 


0 


This result can be written more succinctly as 


grr) [Zo] 


ey (1.232) 


n 


where 6“ [zo] denotes the k'h derivative of [z] evaluated at Zo; This formula is similar to 
that for the Taylor series, except that f is replaced by ¢ and the index is shifted, and reduces 
to the Taylor coefficients for an analytic function with m = 0. However, this method is not 
useful at an essential singularity where m = oo. 

Often we require only the residue of f at z). For a simple pole we identify the residue 
as 


m= 1 = a = Olzo] = lim(< - a) fz] (1.233) 


while for an m-pole one obtains 


(m—1) 
m>l=>a]= oe (1.234) 


For example, consider 


n n 


z 
qiz] ad+bz+c 


Z 
Kj 


(1.235) 


fiz] = 


where we assume that n = 0, a + 0, and that a, b, c are real. (Other cases can be treated 
separately.) The two poles at the roots of the denominator 


-b+ Vb -4 
z = — (1.236) 


are distinct unless the discriminant b? — 4ac happens to vanish. We can then write 


-1,n =1,n 
fl = —— > p = EL (1.237) 


(Z — 2 )(Z -= 29) 4-1 
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where p; is the residue at pole z, and z; is the other pole. If there happens to be a double 
pole, we find 


d -b n-1 
Z =Z% HH > 4 = (Sez) aq ng e a'n(>) (1.238) 


Zo 
An important special case is provided by functions of the form 


fal = ra with glzol =0 q'Izo] + Oplzp) +0 (1.239) 


where the simple pole at zy is a zero of q[z]. The Taylor expansion of q[z] then takes the 
form 


qiz] ~ q’[zl(z — zo) (1.240) 
such that the residue of f[z] at z, becomes 


_ Piol 


a,=5 (1.241) 
A q [zol 
For example, the function 
fl = —— > 2, =Qk+ Dri, R, = -0% (1.242) 
e+] 
has poles at odd-integer multiples of zi with residues easily determined using q’[z,] = —1. 


1.14 Meromorphic Functions 
1.14.1 Pole Expansion 


If a function f[z] only has isolated singularities, it is described as meromorphic. For sim- 
plicity suppose that these singularities are simple poles at z,, where the index lists the poles 
in order of increasing distance from the origin. The behavior near a simple pole can be 
represented by z = z, = f[z] = 2 Thus, the function 


“n 


n 


b, 
gnc] = fil- > — (1.243) 


pak 


is analytic in a disk |z| < R„ where the radius R, encloses n poles. According to the Cauchy 
integral formula, we may write 


2 Leal, LASER $ ds 
all 5g De = og PS 2 c, (s= 2) (s —2;) a 


k=1 n 


where C, is a circle, |z| = R,,, that encloses n poles without any poles being on the contour 
itself. For any z + z, we can use partial fractions to express the second contribution in a 
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form 


ds ds ds 
$——-¢$ = 0 =0 (1.245) 
C, (s — z)(s — 24) C, ST Zk c S-Z 


n 


where cancellation between equal residues is apparent. Furthermore, if z = z, we also find 


d 
$ E (1.246) 
C, (s — Zg) 
and conclude that 
1 
8, |<] tts) ds (1.247) 


2mi Jo s-2z 


n 


for z within C,,. 
Next let 


M, = max| FÍR, e] | (1.248) 


represent the largest modulus found on the circle C,,, such that 


M,R 
<—44 1.24 
sdl = a (1.249) 


bounds g,,. If f is bounded such that R, > oo with finite M,,, we can construct a sequence 


of g, functions which are also bounded as |z| > oo. Thus, the function 
glz] = lim g,[z] (1.250) 


is analytic and bounded in the entire complex plane. According to Liouville’s theorem, 
such a function must be constant! Hence, we can write 


fld=8.+ >, bi (1.251) 
k=1 


Z7Zk 


and all that remains is to determine the value of the constant g,,. Using 
fI = 8 - >, = 8. = fll + ) + (1.252) 
ka *k k=l *k 


we finally obtain the Mittag—Leffler theorem. 


Theorem 17. Mittag—Leffler theorem: Suppose that the function flz] is analytic every- 
where except for isolated simple poles, is analytic at the origin, and that there exists a 
sequence of circles {C, : |z| = R, k = 1, n} where each C, encloses k poles within radius 
R, Furthermore, assume that on these circles |f| is bounded as R, > œ. The function can 
then be expanded in the form 


Sf b b 
flz] = f[0] + (+ + 2) (1.253) 
2 LZ, Z 


n 


where b,, is the residue for pole z,,. 
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Unlike Laurent expansions for which the choice of z) can be somewhat arbitrary, the 
pole expansion for meromorphic functions depends only upon intrinsic properties of the 
function itself. Although the present version places significant restrictions on the function, 
generalizations can often be made fairly easily. For example, if f[z] has a pole at the origin, 
one can apply the theorem to the closely related function g[z] = f[z + Z)] where z, is any 
convenient point where f[z] is analytic. Similarly, if M, œ R”*! for large R,,, one can 
employ an expansion of the form 


m k 00 b m+l 
fiz] = AR +) a (: ) (1.254) 
k=l j n n 


n=1 


Poles of higher order can be accommodated also, but we forego detailed analysis here. 

Pole expansions appear in many branches of physics. If f[z] represents the response 
of a dynamical system to some driving force, the poles generally represent resonances or 
normal modes of vibration while the residues represent the coupling of the driving force 
to those normal modes. Pole expansions can also be used to sum infinite series. 


1.14.2 Example: Tan[z] 


The function Tan[z] has simple poles at z, = (n + Dr with residue b, = —1 for integer n, 
both positive and negative. Thus, circles C, of radius R, = nz enclose 2n poles without 
singularities on the contours. One can show that M, > 1 as n > oo. Hence, Tan[z] fulfills 
all requirements for application of the Mittag—Leffler theorem. The pole expansion can 
now be expressed as 


Tan[z] = =i -(n+3)n i (n+ 5) (1.255) 
e = dz 
a OTE ATEN) (1.256) 
such that 
Tan[z] = Y” E — 


With the substitution z > s7/2, we obtain the partial fraction representation 


ST 1 1 1 
T = + + ee 1.258 
sa ia oe a ome 
Expressions of this type can often be used to sum infinite series. For example, from 

2 
MT ST] N 

lim = Tan| =| = = (1.259) 
one immediately obtains 

= iY r 
= — 1.2 
3 (5 + r) 8 ee 


k=0 
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Then using 


Alan) > de SO] (1.261) 


Me 
| ea 
ll 
= 
4 
M: 
== 
Ro 
—e, 
4 


2 1 T 
> a 1.262 
K ( ) 


1.14.3 Product Expansion 


If f[z] is an entire function, its logarithmic derivative é[z] = f’[z]/f[z] is a meromorphic 
function with poles at the roots of f[z]. If these roots are simple, the corresponding poles 
in ¢ will also be simple. Near a simple root we express f[z] in the form 


22% 2, => fil = @-z,)p,[z] = lz] ~ (1.263) 


L-%, 


where p,[z] is smooth and nonvanishing near z,,. Hence, the poles of the logarithmic deriv- 


ative all have residue b, = 1. Provided that ¢ is suitably bounded at co, we can now use 
the pole expansion of ¢ to write 


dLoglf] _ = 1 i 
dz = +2 +2] ets 
= Loglflz1] - Log[f[0]] = 26, + > (tet — Za] — Log[-z,,] + z) (1.265) 
n=1 n 


where ġo = f’[0]/f[0]. Exponentiating and simplifying this expression, we obtain the 
product expansion 


fiz PIO, N we 
for E. Ao (LLU de" 12 
FO) so 110] | Ml 3 je (1.266) 


n 


where the z, are the roots of f[z]. Like the pole expansion of meromorphic functions, 
the product expansion of entire functions depends only upon intrinsic properties of the 
function. Expansions of this type are often useful in symbolic manipulations, but generally 
converge too slowly to be useful for numerical evaluations. 


1.14.4 Example: Sin[z] 


Although Sin[z] is an entire function with simple poles at z, = +z, we cannot employ 
the product expansion directly because $, is not finite. However, this difficulty is easily 
circumvented by considering instead the function 


fa Sin[z] 
zZ 


= gizl = Cote] - È, 910] =0 (1.267) 
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The positive and negative roots can be accommodated by using two products 


sal _ ñ (1 - De [Ih A er (1.268) 


n=1 n=1 


and combining factors pairwise to obtain 


mE = T] (: - ()) => Sin[z] = {Il - ()) (1.269) 


n=1 


This form displays all the roots of Sin[z] and is, in effect, a completely factored represen- 
tation of its Taylor series. 


Problems for Chapter 1 


1. Complex number field 

In mathematics, a field F is defined as a set containing at least two elements on which 
two binary operations, denoted addition (+) and multiplication (x) satisfy the following 
conditions: 

completeness and uniqueness of addition: Ya, b € F, c = a + b € F is unique 
commutative law of addition: a + b = b + a 

associative law of addition: (a+ b)+c =a+ (b+c) 

a+c=b+c=a=b 

existence of identity element for addition: Ya, b € F, dx ə a+x = b => J0 3 a+0 =a 
completeness and uniqueness of multiplication: Ya, b € F, c = a X b € F is unique 
commutative law of multiplication: a x b = b x a 


associative law of multiplication: (a X b) X c = a x (b x c) 


ARAS DAA eS 


axc=bxcAc+0=>a=b 


a 
© 


. existence of identity element for multiplication: Ya, b € F, Ix + 0 3 a X x = b => 
dlaaxl=a 


11. distributive law: ax (b+c) =axb+axc 


The real numbers R obviously form a field with respect to ordinary addition and multi- 
plication, but it is not immediately obvious that the complex numbers C form a field with 
respect to the extended definitions of addition and multiplication. To demonstrate that C 
is a field, you must identify the identity elements for addition and multiplication and must 
verify that each of the 11 conditions set forth above is satisfied. 


2. Triangle inequalities 
Prove the triangle inequalities: |Iz,| — lz,Il < lz, + z,1 < lz,1 + lz,1. 
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3. Applications of de Moivre’s theorem 


Show that 
Cos[n6] = Cos[6]" — (3) Cos[6]"~? Sin[0]? + 4 Cos[0]""* Sin[0]* +... (1.270) 
Sin[n6] = (i) Cos[0]"7! Sin[6] — a Cos[0]""? Sin[0]? +... (1.271) 


4. Lagrange” trigonometric identity 
Prove: 


n . 1 
Y” Cos{k6] = ave Sin|(n = 2) 0) (1.272) 
k=0 2 2 Sin| $] 


Hint: first prove 


Ya _ 1-2 (1.273) 


5. Quadratic formula 
a) Prove that 


(b? — 4ac)? —b 
2a 


applies even when a, b, c are complex. Why did we not use a + sign in front of the 
square root? 


ad+bi+rc=0=>2= (1.274) 


b) Use the quadratic formula to determine all roots of the equation Sin[z] == 2. (Hint: 
Sin[z] = (w — +) where w = e.) 


6. Assorted trigonometric equations with complex solutions 

Find all solutions to the following equations assuming that a, b are real numbers and that 
lal > 1, lb] > 1. Express your results in the form z = x + iy where x, y are real-valued 
expressions that do not involve trigonometric functions and be sure to consider all cases. 


a) Cos[z] = a 
b) Cos[z] == bi 


7. Series RLC circuit 

A circuit contains resistance R, inductance L, and capacitance C in series with a generator 
of electromotive force &[t] = €, Cos[wt]. Let J[t] represent the current flowing in the 
circuit and Q[t] the charge stored in the capacitor. It is useful to express the physical 
quantities 


E[r] = Re[Ee], It] = Relfe], Qt] = Re[Oe*”] (1.275) 


in terms of complex phasors E, Í, and Q that represent both the magnitudes and relative 
phases for sinusoidal time dependencies. 
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a) Use Kirchhoff’s laws to derive a phasor generalization of Ohm’s law, Ê = 1Z, where 
the impedance Ê = Ze*? is generally complex. Express the modulus, Z, and the phase, 
$, of the complex impedance in terms of the real parameters of the circuit. 

b) Show that the power averaged over a cycle is given by P = - Re[7€*] and evaluate 
this quantity in terms of real parameters. Show that P[w] exhibits a resonance and 
determine its position and full width at half maximum (FWHM). Sketch P[w] and 
olw] together. 


8. Smith chart 
The complex impedance Z = R + iX for an AC circuit is decomposed into resistive and 
reactive components, R and X, where R > 0 and —œ < X < oo. Smith proposed a repre- 
sentation 

Z-1 


= iy = == 1.27 
W =u+iv 771 ( 6) 


that maps the right half-plane for Z onto the unit disk for W. Determine the mappings for 
lines of constant R and lines of constant X. Sketch illustrative samples of each. 


9. Bilinear mapping 
Study the bilinear mapping 


az+b 


=——, ad-bc+0 (1.277) 
cz+d 
by determining the images in the w-plane of representative lines and circles in the z-plane. 


10. Component functions 
Develop explicit expressions for the real and imaginary components, u[x, y] and v[x, y] for 
the following functions of z = x + iy. 


a) f[zl=(?-1)'? 
b) ela = z- 126 + 1)!” 


11. Inverse trigonometric functions 


Prove: 
arcsin[z] = —i logliz +(1-2)” i (1.278) 
arccos[z] = —i log|z +(2-1)" i (1.279) 


— (1.280) 


arctan[z] = E log 

2 i-z 
This can be done by expressing equations of the form z == Sin[f] in exponential form, 
substituting w = e*f, solving for w, and deducing f[z]. Determine the branch cuts needed 


to specify the principal branch of each function. 


12. An identity 
Prove: ArcTan| +5 | = 2 ArcCot[z] 
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13. Principal value for an imaginary power 


Suppose that 
ia — 1)" 
= 1.281 
i E + i ( ) 


where a, b are real. 


a) Show that this quantity is real and find a simple expression for its principal value. 


b) Determine the position and magnitude of any discontinuities. 


14. Derivative wrt z* 

Show that a function f[x, y] of two real variables can be expressed as a function glz, z*] 
of the complex variable z = x + iy and its complex conjugate z* = x — iy. Then show 
that the requirement 0g/0z* == 0 is equivalent to the Cauchy—Riemann equations for the 
components of f and argue that an analytic function is truly a function of a single complex 
variable, instead of two real variables. 


15. Analyticity of conjugate functions 
Suppose that f[z] is analytic in some region. 


a) Under what conditions is g[z] = f[z*] analytic in the same region? 
b) Under what conditions is h[z] = f[z]* analytic? 


c) Under what conditions is w[z] = f[z*]* analytic? 


16. Completion of analytic functions 
Which of the following functions u[x, y] are the real parts of an analytic function f[z] with 
z = x + iy? If u[x, y] = Re f[z], determine f[z]. 


a) u=x—-y 
b) u=x -y +y 


17. Analyticity for the sum, product, quotient, or composition of two functions 
Suppose that filz] = u,[x, y]+i v, [x y] and f,[z] = u,[x, y]+iv,[x, y] are analytic functions 
of z = x+iy. Show that fi + f2, fit. fi fh and f,[f,[z]] are analytic functions under appro- 
priate conditions by demonstrating consistency with the Cauchy—Riemann equations. Be 
sure to specify the requisite conditions for each case. 


18. Equipotentials and streamlines for exponential function 
Sketch the equipotentials u[x, y] and streamlines v[x, y] for w = e? where z = x + iy and 
w=urtiy. 


19. Equipotentials and streamlines for Tanh 
Evaluate and sketch the equipotentials and streamlines for the hyperbolic tangent. 


20. Cauchy-Riemamn equations in polar form 


Suppose that z = x + iy = re“ is expressed in polar form and let f[z] = Re’ where R[r, 6] 


and Ofr, 6] are real functions of r and 6. Derive Cauchy—Riemann equations relating R to 
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ue and E to ue for differentiable functions. (Hint: consider infinitesimal displacements 


dz, and dz, in the ? and Ê directions.) 


21. Circular average of analytic function 
Demonstrate that if f[z] is analytic within the disk |z — zol < R then the average value of 
fl% + re] on any circle |z — Zol = r < R is equal to the value at its center, f[z¿]. 


22. Maximum modulus principle 

Prove that, if f[z] is analytic and not constant within a region R, then |f[z]| does not have 
a maximum within the interior of R. Hence, if f is analytic and not constant within R, |f| 
must reach its maximum value on the boundary of R. 


23. Extrema of harmonic functions 

Suppose that u[x, y] is harmonic and not constant within region R. Prove that u[x, y] has 
no extrema (neither maximum nor minimum) within R; hence, its extrema must be found 
on the boundary of R. [Hint: apply the maximum modulus principle to ef! where f is 
analytic within R.] 


24. Absence of extrema in |f| for analytic functions 

If F[z] is analytic in domain D, demonstrate that |f[z]| has no extrema in D. Hint: use 
the Taylor series representation to show that no neighborhood |z — z)| < r contains an 
extremum. 


25. An application of the Cauchy integral formula 
Suppose that f[z] is analytic on and within the simple closed positive contour C. Evaluate 
the following integrals. 


a) 4. 48 dt 


Ini SC e-z 


b) 1 Pre fit dí 


ini Pe Pz 


26. Derivatives of analytic functions 
Assume that f[z] is analytic on and within a positive simple closed contour C that encloses z. 
Use induction to prove 


JD + Az- fe] al fa fiw] 
Az = mi h” 


fP] = lim (1.282) 


where f is the n' derivative of f. 


27. Fundamental theorem of integral calculus 
Prove the fundamental theorem of integral calculus: If f[z] is analytic in a simply con- 
nected domain D that includes z, and z, then 


F[z] = f l fidt (1.283) 


is also analytic in D and f[z] = dF[z]/ dz. 


60 1 Analytic Functions 


28. Poisson integral formula 
a) Suppose that f[z] is analytic within the disk |z| = r < a. Prove 


1 
fiz = gE z a (1.284) 
2ni Jels-=2 s-£ 
where C is a circle of radius a centered on the origin. Then deduce the Poisson integral 
formula 
; 22 27 ib 
f [re] = E f NERE dé (1.285) 
2r o a +r —2arCos[¢ — 0] 


b) Suppose that we know the electrostatic potential y on the surface of a long cylinder as 
the real part of an analytic function (consider only two spatial dimensions). Obtain a 
general formula for the potential at any point within the cylinder. Compare the poten- 
tial at the origin with the mean-value theorem. Note that, although a formal proof is 
not required, the Poisson integral formula can be applied for any function that is har- 
monic within C except for a finite number of jump discontinuities upon C. 

c) As a specific illustration, compute the interior potential given that y[ae'?] has the 
constant value V for $, < f < @, and is zero on the rest of the cylinder. Display 
the angular dependence for a representative selection of r values for some choice of 


hh. 
29. Uniform convergence of power series 


Suppose that the power series f [z] = Yj) a,z* converges absolutely such that f[z] = 
lim falz] for |z| < R. Show that f [z] converges uniformly in any subdisk |z| < B < R. 


n>00 


30. Convergence of series representation for e* 

Demonstrate explicitly that the series Y, 2*/k! is absolutely convergent for all z and that 
it is uniformly convergent in |z| < R for any finite R. Can one properly claim uniform 
convergence for all z? 


31. Sharpened ratio test 
a) The integral test can be used to established absolute convergence of the series repre- 
sentation of the Riemann zeta function 


“gle > a (1.286) 
n=1 
when Re[z] > 1. Use the Weierstrass theorem to prove that ¿[z] is analytic for Re[z] > 
1. (This function has an important role in number theory and often appears in theoret- 
ical physics. It can be extended to most of the complex plane by analytic continuation, 
but that is beyond the scope of this problem.) 
b) Use this result to obtain a sharpened form of the ratio test that states when the ratio of 
successive terms takes the form 
an+ 


an 


aj = (1.287) 


n 


for large n, the series converges absolutely if s > 1. 
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c) Prove the existence of Euler’s constant 


y = lim È I = Lot (1.288) 


n> 
k=1 


32. Laurent series 
For each of the following functions, construct a complete set of Laurent series about the 
specified point and specify their convergence regions. 
= 1 = 
a) fk]= EDED about z = 0 
b) fla = A about z = 2 


c) fk]= gp about z = 0 


d) fiz] = Sin[z + 4] about z = 0 


33. Laurent expansion for z? Log| | 


a) Define a single-valued branch for 


fld=2 Log| ¡| (1.289) 


and specify the region where your definition is real. 
b) What is the nature of the singularity at infinity? 


c) Construct a Laurent expansion for |z| > 1. 


34. Some trigonometric series based upon a Laurent series 
Evaluate the Laurent series for (z — a)! where —1 < a < 1 in the region |z| > a. Then use 
z > e" to compute 2-1 a” Cos[mé] and >), 4” Sin[mé]. 


m= 


35. Grounded cylinder normal to uniform external field 

a) Suppose that an infinitely long conducting cylinder of radius a is grounded. The cylin- 
der is subjected to a uniform external electric field directed perpendicular to its sym- 
metry axis. Use an analytic function to evaluate and sketch the equipotential surfaces 
and the net electric field. (Hint: expand ®[z] = ¢[z] + iW[z] as a Laurent series around 
the origin and determine the coefficients using the appropriate boundary conditions.) 


b) A two-dimensional incompressible fluid flows around an infinite cylinder whose axis 
is normal to the plane of motion. At large distances the velocity field is uniform. 
Evaluate and sketch the streamlines near the cylinder using an analytic function. 


36. Isolated singularities 
Classify the isolated singularities for each of the following functions. Be sure to consider 
the point at oo, using z > 1/w with w > 0. 
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J= 
b) Cos[z] 
Z 
c) ze!” 
eË 
d Ss 
zZ +A? 


37. Singularity sequence 
Identify and classify the singularities of 


1 
2e 1.290 
fll = ra a 
Is the singularity at the origin isolated? Is it a branch point? 
38. Residues 
Locate the poles for each of the following functions and evaluate their residues. 


z+1 
a) Scan 
z“(z + 21) 
b) Tanh[z] 
e 
c 
) 24 mr 
1 
d ——— int 
) FED (integer n) 


39. Pole expansions 
Develop pole expansions for the following functions, being sure to verify that the necessary 
conditions are satisfied. 


a) Cot[z] 

b) Csc[z] 

40. Product expansions 

Develop product expansions for the following functions, being sure to verify that the nec- 
essary conditions are satisfied. 

a) Coslz] 

b) Sinh[z] 

41. Product expansion for even functions 

Suppose that f[z] is entire and is even, such that f[-z] = f[z], and that its roots are all 


simple. Also assume that, except for simple poles, its logarithmic derivative is bounded at 
infinity such that the product expansion of f[z] converges. 


a) Show that the product expansion can be expressed in the form 


fiz] = F10] Mb - (2) (1.291) 
n=1 n 


where f[+z,] = O and where the product includes only one member of each pair of 
roots. 
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b) Show that 


Oo 
SES 1.292 
fo) 42 aa 
FOO] _ (£710) 
= 3 - 12 = 1.293 
Am ) 2 ¿ hs 


c) Apply these results to f[z] = Sin[z]/z and evaluate the following sums: 


` n? y nt (1.294) 


42. Contour integration of logarithmic derivative 
Suppose that f[z] is analytic within a domain D containing the positive simple closed 
contour C. The function ¢[z] = f’[z]/f[z] is known as the logarithmic derivative of f. Let 


= mi T olz] dz (1.295) 


a) Suppose that z, is the only zero of f within D and is of order m. Show that J = m if C 
encloses Zp. 


b) Evaluate Z assuming that f’[z] + O in D and that C encloses N roots of f but that 
Fl] #0 onc. 


43. Argument principle 

a) Suppose that f[z] is analytic and nonzero on the positive simple closed contour C 
and that it is meromorphic in the domain D contained within C. The function ¢[z] = 
f'l fiz] is known as the logarithmic derivative of f. Prove that 


af blz] dz = N) - (1.296) 


where N, is the number of zeros and N, is the number of poles in D where each 
accounts for multiplicity (e.g., a double root or double pole counts twice). 


b) Show that 
fot dz = iAç arg[ f] = 27i(No — N,) (1.297) 
c 
measures the change in the argument of f [z] as z moves around C. (Hint: consider the 
image C > T under the mapping w = f[z].) 


44. Rouchés theorem 
Prove that if f[z] and g[z] are both analytic on and within the simple closed contour C and 
le[z]l < |f[z]l on C, then f[z] and f[z] + g[z] have the same number of zeros within C. 


2 Integration 


Abstract. Contour integration provides very powerful methods for evaluating inte- 
grals. We also consider several useful tricks that are more elementary but sometimes 
unfamiliar. Finally, integral representations are introduced for a variety of functions. 


2.1 Introduction 


Integration in closed form is rapidly becoming a lost art. Unlike differentiation, whose 
clear rules permit direct if tedious evaluation, integration often relies upon trial and error 
with many dead ends. In the heroic era of theoretical physics, cleverness in changing vari- 
ables or choosing contours was revered, but now practically any integral that can be done 
in closed form may be found in standard compilations, such as Gradshteyn and Ryzhik, or 
in mathematical software, such as MATHEMATICA’. Although some skill in symbolic integra- 
tion is still needed for traditional examinations, for most physicists its usefulness beyond 
the PhD qualifier examination is rather limited, unless you happen to be teaching a course 
that still relies on the methodology of the nineteenth century. Nevertheless, compilations 
are not entirely complete and software packages are not perfect. Furthermore, traditional 
integration methods remain useful for developing insightful approximations to integrals 
that cannot be evaluated fully in closed form. Numerical methods provide answers but 
limited insight. Therefore, in this chapter we briefly present some of the most useful sym- 
bolic techniques. We also discuss integral representations of analytic functions. 


2.2 Good Tricks 


Presumably integration by parts and variable transformations are too familiar to merit 
discussion here, but there are several other elementary methods that are quite valuable 
but with which students are generally less familiar. 


2.2.1 Parametric Differentiation 


Often when one integral is known, an entire family of related integrals can be developed 
by differentiating with respect to a parameter in the integrand. For example, given 


hlà] = IN e dx = I (2.1) 


0 


the entire family 


Te Go ON n! 
LIA -f e “d;=(-7) hA = w (2.2) 
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becomes available. If one needs to evaluate an integral, like f = xe dx, that is pre- 
sented without a parameter, simply insert one to obtain 


00 E av fr Gn-D1 
2n y Ax? = _ 
fo q (- a) AE ~ gnynts (2.3) 


and then set A > 1. It might surprise you how often this trick is helpful. 
Notice that if a parameter appears in the limits of integration, one must also include 
the variation of these limits using 


ka 
an 


bLA] 
fix, Alda = pa ai l ax x+ fb, ye - fia Ta (2.4) 
alA] a Or 


with the rhs evaluated for the appropriate A. 


2.2.2 Convergence Factors 


Sometimes when it is not obvious whether the integral of an oscillatory function over an 
infinite range will converge to a definite value, application of a convergence factor may 
help resolve the question. For example, it is not obvious, at least to this author, whether 
5 Sin[kx] dx converges. Consider instead 


-Àx Q; —Ax „ikx 
f e Sin[kx] dx= in| | e e dx] = m EVI e (Q.5) 


which does converge for A > 0. The desired integral is then obtained from the limit A = 0, 
whereby 


f Sin[kx] dx = lim { e™ Sin[kx] dx = + (2.6) 
0 130 Jo k 


Admittedly, this result does appear somewhat arbitrary and some skepticism is justified. 
However, if this integral were encountered in a physics problem, it probably would arise 
from a limiting process anyway. Either a spatial or temporal variable should be limited 
to a finite range or a damping mechanism should be present that ensures convergence. 
One should then retreat a few steps in the derivation, identify the appropriate convergence 
factor, and evaluate the integral before that convergence factor is lost from view. 


2.3 Contour Integration 


2.3.1 Residue Theorem 


Suppose that f[z] is analytic throughout a domain D except for isolated singularities 
(poles) and that a simple closed contour C within D encircles poles {z,,k = 1, N} with 
residues R,. By deforming the contour to encircle each pole, we obtain 


N 
dra dz= 9, Q flldz (2.7) 
c k=1 Y Ck 
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where each Ç, is a small circle encompassing pole k. Near each z, we can employ a Laurent 
expansion 


Flz] = 3 apez- %) = ¢ Flz] dz = y an Q (z-z) dz (2.8) 


n=-My n=-My 


to evaluate its contribution to the contour integral. Using the now familiar circular contour 
integration with z — z, = pe” => dz = pei d0, we obtain 


2r 
$ (2-2) dz = ip™! f ef) do = 2riô, _, 
E 0 


> $ fiz] dz = 2ria_; , = 27iR, 
C, 


k 


(2.9) 


Therefore, we obtain the residue theorem: 


Theorem 18. Residue theorem: If f(z] is analytic on and within a simple closed counter- 
clockwise contour C, except for interior poles {z,, k = 1, N} with residues R,, then 


N 
VHG dz = 2ni Y Ry 
C k=1 


This is an amazingly powerful theorem that can be used, with clever choices of contour, 
to evaluate a wide variety of definite integrals which might be very difficult by means of 
familiar antidifferentiation methods. The trick is to find a simple closed contour that con- 
tains the desired integral on one portion of the path with easier integrals on the remainder 
of the path (if any). The examples in following subsections will demonstrate that contour 
integration using the residue theorem provides some of the most versatile methods for 
evaluating definite integrals. 


2.3.2 Definite Integrals of the Form la fisin 9, cos 0] dé 


We assume that f[sin 0, cos 6] can be represented by a single-valued function of z = e” in 
the relevant region of the complex plane. Often f is a rational function of sin 0 and cos 0. 
Then we use 


d et zi 
z=e = d0 =-i, sing =* > - Capa E (2.10) 
zZ 
such that 
2m ol —1 d 
f[Sin[6], Cos[0]] do = E a | s 
2i 2 Z (2.11) 


= 27 J residues within unit circle 


where the contour is the unit circle about the origin. Special handling is needed if any of 
the singularities of f are on the unit circle. 
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Figure 2.1. Poles in the integrand from Eq. (2.12). Left: Im[a] = 0. Right: Im[a] > 0. 


2.3.2.1 Example: 1[a] = f; on de 


0 


Although this integral is not stated in the desired form, the integrand is even. Hence, by 
direct application of the recipe above, we find 


Iļja] T EA $ : d (2.12) 
a] = = “Ata Y ——————_y, dz ; 
2 Jo a? + Sinf? 422 — (2 -1f 


The denominator is a quadratic in z?, so that one obtains four poles at 


denominator = 4a*z? - (z? = 1); 


poles = z/ . Solve[denominator == 0, z] 


lua - Vita’,a-Vita’,-a+Vita’,at Viral] 


Evaluation of the residues 


= ,Tz, #| &, poles] 


Map [Residue A 
denominator 


1 1 al 1 
l 8aV1 +a? 8aV1+a? SaVi+al. | 
is straightforward, even by hand. If a > 0 the pair of poles at + (a =v1+ a’) is inside the 


unit circle while the other pair is outside, while if a < 0 the reverse is true. In either case 
we have two equal contributions, such that 


Ia] = —— 2.13 
[a] VET. (2.13) 
Figure 2.1 illustrates the positions of the poles relative to the unit circle. The left figure 
uses a real value for a, while the right figure uses a complex value. Although one typically 
assumes that parameters in definite integrals are real, the method is more general and does 
not require that assumption. 
Alternatively, if we use a trigonometric identity to express the integrand as 


1 - Cos [20] 


= f Sin[0]? > ——— / / Simpli fy 


a? + Sin [0]? 
2a 
1+ 2a? -Cos[20] 
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we obtain a contour integral with a quadratic denominator 


e 

LT a 1 
masl = di d 2.14 
Lal i 1 + 2a? — Cos|[8] dara í ae 


for which evaluation of the residues is easier by hand. 


denominator = z? - 2 (2a? + 1) z+1; 


poles =z/. Solve [denominator == 0, z] 
la +22? -2aV1+a?, 1+2a?+2aV1 val 


Map [Residue | ¿tz "| &, poles] 


denominator 


1 al 
l 4aV1 +a? ? J! 
Notice that there are only two poles in the transformed function because the angular vari- 
able was replaced by 6 > 0/2. Recognizing that for either sign of a just one of the poles is 
within the unit circle, one obtains the same final result. 


2.3.3 Definite Integrals of the Form f > fix]dx 


We assume that f[z] is analytic except for isolated singularities and vanishes faster than 
z7! for r > œ in either half-plane. With these conditions we can employ a semicircular 
contour of radius R => co closed in the appropriate half-plane to obtain 


f fix] dx = $ fiz]dz = 27.4 y residues in half-plane (2.15) 


To prove this result, suppose that f[z] is bounded in the upper half-plane such that 


|f [Re"]| = MR" = | f E [Re*] ad < MR “TR (2.16) 
0 


where M is a positive real number. Then 


a> 1= lim If f [Re] do| < lim xMR'"" = 0 
0 (2.17) 


R 
= lim I fix] dx = 2ni ) residues in half-plane 
R=% -R 


ensures that if f falls fast enough we need only evaluate the residues at isolated singulari- 
ties of the analytic function f[z] in the appropriate half-plane. Therefore, we find 


Jim R|f [Re] 


R 
=0= lim f fix]dx = 2ri y residues in half-plane (2.18) 
R>00 -R 


using a great semicircle. 
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2.3.3.1 Example: /[a] = [" 1; dx 


The integral 
e 1 1 
dx =  — dz 2.19 
IN 1+x4 $ 1+ ( ) 
can be evaluated using a semicircular contour in either half-plane. The integrand has poles 


at z, = Explir2"] for k = 1, 4 of which two are found in each half-plane. The residues 
can be evaluated using 
1 1 1 Í aa; 1+ 
HMa===>R=>=3=10%%*=-—H (2.20) 
qlz] i q [ze 4 4 4V2 


Thus, we obtain 


A 1 1+1% m 
ax= h dz= 2il- sy 2.21) 
La c1+2 ral V2 


2.3.4 Fourier Integrals 


Consider a Fourier integral of the form 


a i de dx (2.22) 


where k is a positive real number. Integrals of this type can often be evaluated by extending 
f to the complex plane and using a great semicircle in the upper half-plane, provided that 
the contribution of the return path 


Ip[k] = iR f i f[Re**] Exp[-kR(Sin[6] — i Cos[6])] 40 (2.23) 
0 


vanishes in the limit R > oo. Suppose that Mp is the maximum modulus of f on this arc, 
such that 


|Z| < RMg f Exp[—&R Sin[6]] 49 (2.24) 
0 
Dividing this latter result into two equal contributions now gives 
m/2 
[Ip] < 2RMp f Exp[—&R Sin[6]] d0 (2.25) 
0 


Figure 2.2 illustrates that Sin[6] > 26/7 on this interval. Thus, the integral on the great 
semicircle is limited by 


m/2 
[In| < 2RMp il Exp[-2kR0/1] d0 (2.26) 
0 


or 


= eE 


IZr| < T™Mr k 


(2.27) 
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1 
0.8 
0.6 
---- 20 /n 
0.4 
— Sin[6] 
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Figure 2.2. 


This result is known as Jordan’s lemma. Therefore, if k > 0 and if f[z] vanishes on an 
infinite semicircle, such that 


lim Mz = 0 = |R| = 0 (2.28) 


the contribution of the return path vanishes and we may evaluate the Fourier integral using 


jim | f [Re"] =0 => f flxle™ dx = 2ni E residues of integrand in half-plane 


(2.29) 


If k happens to be a negative real number, we close in the lower half-plane instead and 
obtain the same result. Notice that this condition upon f is less restrictive than in the 
previous section due to the presence of the exponential factor, which is damping in the 
appropriate half-plane. However, convergence for k = 0 requires limp ,., RIF [Re*%]] = 0 as 
before. 


2.3.4.1 Example: f > Sak de 


Xx" +a 
Consider the integral 


Ak] = f E (2.30) 
0 


Ltg 


where k and a are positive real numbers. Although this integral is not presented in the 
desired form, it is simply half the real part of 


= œ Explikx] Exp[ikz] = 1 z 

¿lk] = f A dx = $ PY dz => FU = = Relik] (2.31) 
-o X +a z +a 2 

and may be evaluated using a great semicircle in the upper half-plane, wherein lies one 

simple pole at z = ia. Therefore, we obtain 


alk] = 2ni 


Exp[-ka] ne 
2 2.32 
zj (2.32) 


= fik] = a 


without further ado. The result is actually more general than this derivation — it applies 
equally well for complex a provided only that Re[a] + 0 to ensure that the poles are not on 
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the real axis. Be alert to generalizations! Having expended some effort to obtain a result, 
it is good practice to extend it to the most general conditions possible. Also notice that 
one cannot use Cos[az]/(2? + a?) directly because Cos[az] is not bounded on the great 
semicircle — it diverges exponentially for large Im[z] in both upper and lower half-planes. 


2.3.5 Custom Contours 


Sometimes it is necessary to design a contour which exploits specific characteristics of 
. . . . ‘CO 2 
the integrand. For example, when previously evaluating the integral Í Cos[x”] dx we 
employed an arc subtending 7/4 radians. Unfortunately, there are no general rules to guide 
one toward the optimum contour for an arbitrary integrand; one must rely on intuition and 
experience to minimize the amount of trial-and-error in choosing such contours. Below we 
give just one more example of a custom contour. 
Consider the integral 


t=[ a O<a<1 (2.33) 
The integral on the real axis converges because the integrand is of order e“* for x => —oo 
or Exp[(a — 1)x] for x > oo and decreases exponentially in either limit when 0 < a < 1. 
However, the function 


FI] = = 2, = (2k + 1)ri (2.34) 


e 
e+) 
with simple poles on the imaginary axis at odd-integer multiples of 7 does not satisfy the 
conditions needed to employ a great semicircle. Fortunately, the contour integral 


R 277 =R 
È fcldo= jim ( f fods f fiRdy+ f fi 2mdx 


0 
-f ME ay) (2.35) 


can be evaluated fairly easily using the rectangular strip, (~œ < x < œ,0 < y < 27), 
shown in Fig. 2.3. This contour encloses a single pole at z, = ia with residue —e'“, such 
that 


$ f[zl dz = -2rie™ (2.36) 
The contributions from vertical segments 
. o EplaR tiy) _ ,. M 
Jim SIR») = lim gor rpi T Jim Bola — DR] =0 (2.37) 
: a Expla(-R+iy)] _ ,. 7 
RSQ CRD = Be Expl-R + iy] + 1 PREO ee 
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Figure 2.3. Rectangular contour used for Eq. (2.33). 


vanish in the limit R > oo, while the horizontal segments are related by 


_ Expla% + 27i)] _ i 
fix 27)] = Epke Exp[2ria] f[(x, 0)] (2.39) 
such that 
e — paña = T 
$ fizldz = (1 - e™)I = I = Sintra] (2.40) 


This result actually finds somewhat broader applicability because the contributions 
from the vertical segments vanish provided only that O < Re[a] < 1. Therefore, we can 
allow a to be complex and find 


oo eX T 
dx= f R 1 2.41 
E Pal” Sag PRES Gah) 


As always, be alert for possible generalizations. 


2.4 Isolated Singularities on the Contour 
2.4.1 Removable Singularity 


Often one encounters isolated singularities on the integration path. For example, the inte- 
gral 


I= f SSA (2.42) 


Xx 


00 


is important in Fourier analysis. The integrand has a removable singularity at the origin, 
but that would not cause any difficulty if the integral remained in this form. However, 
because Sin[z] is divergent as y > co, we would prefer to evaluate 


fin f € dx (2.43) 
eae 


by closing the contour in the upper half-plane. The penalty for this transformation is 
that the singularity at the origin is no longer removable. Fortunately, that problem can 
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de 


Figure 2.4. Great semicircle with small detour that excludes a removable singularity at the origin. 


be avoided by also making a small semicircular detour around the origin as sketched in 
Fig. 2.4. According to Cauchy’s theorem, the integral 


$ f dad (2.44) 


Z 


vanishes on this contour because no singularities are enclosed. The contribution of the 
great semicircle vanishes because the integrand satisfies the requirements of Jordan’s 
lemma. The linear segments 


im( f 7 fix]dx + f > fix] ds) (2.45) 


converge to the desired integral in the limit e > O because the integrand is well-behaved 
on the real axis. The contribution of the small semicircle of radius £ > 0 is evaluated using 


0 0 
di=id0=i | f[zl2d0 = i lim f Expļise”®] d0 = —in (2.46) 
Therefore, we find 


f TPU sg (2.47) 


a x 


Notice that the contribution of the semicircular detour is mi times the residue of the 
integrand, half the value we would have obtained from the residue theorem for a complete 
circle around the singularity. More generally, if f[z] is analytic at zo, the detour integral 


dz = f [zo] i40 (2.48) 


is proportional to the angle subtended. This result is easily proven by expanding f[z] 
around zgo. Also, notice that A@ is positive for counterclockwise or negative for clockwise 
detours. 
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2.4.2 Cauchy Principal Value 


An improper integral whose integrand is singular at one of the endpoints of the integration 
range is defined by one of the limits 


b b-€ 


b b 
f fl dx = lim f flxldx or fixldx = lim fix]dx (2.49) 


However, if an isolated singularity lies within the range of integration then two limits are 
needed 


Cc b=e Cc 
f fx] dx = lim ' flex + lim f flx]dx (2.50) 
a EY Ja £27" Y bre, 


and often there will be no unique value if the two limits are taken independently. For 
example, applying this method to 


ta “id ta 
x Í X : x : e 
— = lim — + lim == lim log[= (2.51) 
-1 A €,20 J_1 x €)70 se, * £,>0,£,>0 €, 
does not provide an unambiguous result unless one decides to approach the singularity in 


a symmetric manner, such that e, = e). This particular value is designated the Cauchy 
principal value of the integral and denoted by P f , such that 


c b-e ë 
pf fix] dx = in ( Sx] dx + f fix] ds) (2.52) 
a e a b+e 
Hence, for the example above we find 
1 
d. 
al (2.53) 
-1 x 


If there are several isolated singularities on the contour, the Cauchy principal value treats 
each symmetrically. 
The Cauchy principal value is based upon antisymmetric behavior near a simple pole, 
where we can write 
a 


fal = — + elz] (2.54) 
g= Zo 


with g[z] analytic near zọ. As illustrated by Fig. 2.5, the two divergent contributions on 
either side cancel, leaving behind the background contribution g. Of course, this cancella- 
tion does not work for a double pole with symmetric divergence. 


2.4.2.1 Example: P [S Bl dx 


ax 
Consider the integral 
pü k. œ Exp[ik. 
I= pf sel E refe f a dx (2.55) 


o A -x co aA -xX 


where a and k are positive real numbers. By replacing Cos with Exp we are able to close 
the contour in the upper half-plane and use Jordan’s lemma to discard the contribution 
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Figure 2.5. Asymmetric behavior of the divergent parts of an integrand with a simple pole. 


—a a 


Figure 2.6. Kilroy contour: indentations that exclude poles at +a resemble the eyes of someone 
peeking over a fence. 


of the great semicircle. The singularities at x = +a are avoided using small semicircular 
indentations, as sketched in Fig. 2.6. (Those familiar with World War II iconography might 
call this the Kilroy contour.) 

The contribution from the segments along the real axis sum to the principal value, while 
the indentations contribute —iz times the sum of the two residues because the semicircles 
are traversed in a negative sense. Thus, 


E ik co E ik ika —ika 
o= p a P dm +S ) (2.56) 
C = a 


a -z o AX 


gives 


© Bxplik © Cos[k 
P f 2da dx = = Sin[ka] = P f Social dx = = Sin[ka] (2.57) 
o AX a oo A a 
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Notice that the principal value is the same whether we choose indentations to exclude 
or to include either or both poles (four possibilities) — check this for yourself! Thus, the 
contribution to a principal value integral made by a simple pole on the contour is half the 
value it would have made if enclosed. 


2.5 Integration Around a Branch Point 


If the integrand involves a multivalued function, care must be taken with any branch cuts 

in the integration region. Such functions often differ in phase on opposite sides of the cut, 

so that contours with segments on opposite sides will include contributions of equal mag- 

nitude but different phase. Often the presence of a branch cut actually assists in evaluation 

of an integral — branch cuts are not always monsters to be feared! While it is difficult to 

formulate general rules, a couple of examples should suffice to illustrate the method. 
Suppose that we wish to evaluate the integral 


a yf 
i ph with —1<a<0, b>0 (2.58) 
0 x 


using contour integration. The function 


„a 
4 


zt+b 


fi] = 


(2.59) 


has a pole at z = —b and requires a branch cut to define the phase of the numerator when a 
is nonintegral. This branch cut will connect singularities at z = 0 and z > oo, but we have 
some freedom in choosing its orientation. Although one usually cuts such a function along 
the negative real axis, for this purpose it is more convenient to put the cut on the positive 
real axis so that the desired integral is found on one segment of the contour sketched in 
Fig. 2.7. For now we assume that b > 0 so that the pole is on the negative real axis. Thus, 
for this problem we will cut just below the positive real axis and define the principal branch 
as 


z“ = |Z“ Exp[iaarg[z]] with 0 < arg[z] < 27 (2.60) 


by restricting the phase of z to the range (0, 27r). The residue of the pole is then b* Exp[iza] 
such that 


VHG dz = 2nib* Expliza] (2.61) 
c 


on the positive contour below, which encloses the pole without crossing the branch cut. 
The contribution of the outer circle vanishes in the limit R > oo because 


Iz] > co => |f[z]| Il! (2.62) 


falls more rapidly than |z|! when a < 0. Similarly, the integral on the inner circle with 
radius e vanishes in the limit e > 0. The contributions on either side of the real axis are 
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x! 


Figure 2.7. Contour used to evaluate 18 + 


dx for positive real b. 


both proportional to the desired integral but differ in phase. On the top side of the cut 
z“ => x“ while on the bottom side z“ > x“ Exp[2rr1a], such that 
xt 


x+b 


(1 — Exp[2zia]) f dx = 2nib* Exp[iza] (2.63) 
0 


Therefore, we obtain the integral 


00 xt bir 
— dx === —= f -l<a<0, b>0 2.64 
f x+b* Sin[xa] > j a 
This result can be generalized by recognizing that we need only require —1 < Re[a] <0 
to ensure that the contribution of the outer circle vanishes as its radius becomes infinite. 
The definition of b° = Exp[aLog[b]] = Exp[a(Log[lb|] + ¿Arg[b])] can be extended to 
complex powers. Therefore, we obtain the more general result 


f > dx = en for —1<Rela]<0 and Arg[b] +7 (2.65) 
without extra work. However, this result does not apply if Arg[b] = 7, because then there 
would be a pole on the contour. To handle that situation, we employ the contour shown in 
Fig. 2.8 for which the segments on either side of the positive real axis are proportional to 
the principal value of the desired integral. 

This contour encloses no poles, so that 


p<0= q) Z de=0 (2.66) 
cz+b 


The integral around the great circle vanishes, as before, but we now have two small semi- 
circular indentations to evaluate. Using z + b = ee? => dz = ice" dé, both contributions 
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y 


A 
> 


xt 


Figure 2.8. Contour used to evaluate Le 5 


dx for negative real b. 


take the form 


% (—b + se") 


lim D 


ise”? de = (by ete? (0, = 9) = i(-b)“e"? A0 (2.67) 
e>0 0 ee 


where on the upper semicircle ø = 0 and 6, = 7,0, = 0 => A@ = -rz while on the lower 
semicircle we must choose ¢ = 27 and 6, = 27, 0, = 1 => A@ = -rz for the selected 
branch of z“. The integrals along the real axis on either side of the branch cut are both 
principal-value integrals with different phase factors because on the upper side z* = x“ 
while on the lower side z“ = x“e?"". Thus, we obtain 


2ria 2 gf ) . a 2nia 
(l-e Jef ap A = i) (1 +24) (2.68) 
and conclude 
¡E 
b<0= pf — dx = —(—b)*nCot[za] (2.69) 
0 X+ b 


Notice that the cut does not diminish the influence of the pole — choosing the cut to run 
over the pole does not mask it; we could have placed the cut somewhere else if that had 
been more convenient. 


2.6 Reduction to Tabulated Integrals 


Some integrals that cannot be evaluated in terms of elementary functions occur sufficiently 
frequently to merit naming as special functions. Among the most useful are the gamma 
function 


T] = E rletdt, x>0 (2.70) 
0 
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and its cousins the incomplete gamma function 


T[x, a] a| leidt, x>0, az0 (2.71) 
and the beta function 
Tr] : 
pma- f a —1)"! dt (2.72) 
T[r + s] 0 
Also important are the sine and cosine integrals 
* dt > dt 
Si[x] = f A Sin[t]  Cilx] = - f = Cos[t] (2.73) 
0 * 
and the exponential integrals 
” dt * dt 
E, [x] = { me” Ei[x] = P f rd (2.74) 
1 00 
The error function Erf[x] and complementary error function Erfc[x] = 1 — Erf[x] are 
defined by 
Erf[x] a f dte” Erfc[x] a ll "dret (2.75) 
x] = — x] = — . 
yr 0 Vn x 
while their trigonometric cousins are the Fresnel sine and cosine functions 
x t2 Xx t2 
Six] = ‘i dt sin Z| Cix] = f dt Cos 5 (2.76) 
0 2 0 2 


Once an integral has been reduced to one of these special functions, it can be considered 
done because these functions have been studied and tabulated extensively and are also 
available in many mathematical software packages. 
4 
2.6.1 Example: |” e" dx 
—00 


The integral 


I= i: e* dx =2 f e* dx (2.77) 
—oo 0 


can be expressed in terms of a gamma function using the variable transformation 


1 
y= dy = 4x? dx = dx = y dy (2.78) 
whereby 
EE a 1 pl 
I=- e? dy = lal 2.7 
o f yege H (2.79) 


is obtained directly. 
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2.6.2 Example: The Beta Function 
A surprisingly wide variety of integrals can be expressed in terms of the beta function 


T[p1Plg] 


Bip, ql = Tp +q] 


(2.80) 
where p, q are generally complex. For the present purposes it will be sufficient to assume 


that p, q are nonnegative real numbers so that we can employ the simplest integral repre- 
sentation 


T[p] = f e “uP! du (2.81) 
0 
for the more familiar gamma function. The substitution u > x? provides 
T[p] = f ew du=2 f eo 2?! dx (2.82) 
0 0 
such that 
riria = 4 f Í eLA- ay dy 
7/2 (2.83) 
= af foe P Cos[o}??-! Sinj]! dr do 


can be represented as integral over the first quadrant in polar coordinates. The radial inte- 
gral can be performed in terms of the gamma function 


oa 1 
T e” pop +2g+l dr= al ip + ql (2.84) 
0 
such that 
f 7 Cos[6]??"! Sin[a]?7! dé = Tietia _ -lgi ] (2.85) 
6 Mp 2... 


is expressed in terms of the beta function. The left-hand side would probably appear for- 
midable to the uninitiated! We will find both gamma and beta functions very useful as the 
course progresses. Several additional variations are developed in the exercises. 


2.6.3 Example: m dw 


æ 1 
Integrals of this form appear in the statistical mechanics of Bose systems. Here we assume 
that œ > 0 and that n is a nonnegative integer. The integrand can be expanded as a power 
series in the small quantity e~®”, such that 


= g" poo ñ 
| ef Too dw = ye w" Exp[-mfw] dw (2.86) 


m=1 
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We evaluated the integrals on the right-hand side using parametric differentiation at the 
beginning of this chapter. Using that result, we write 


E 1 ! 
[xs ee = 1 dw = a 2, m”+! = gato + 1] (2.87) 


m= 


where the Riemann zeta function is defined by 


da=) (2.88) 
“Am 
for Re[z] > 1. Special values for integer arguments can be expressed in terms of the 
Bernoulli numbers and evaluated in closed form, but for our purposes we can consider the 
problem solved because the properties of the Riemann zeta function are well established 
and one can find numerical values in standard tables or mathematical software. Alterna- 
tively, the series can be evaluated numerically in a straightforward manner, if necessary. 
More generally, we combine the same expansion with variable transformations to write 


oo we 
——— du = ea f -dt =p! IN t° Exp[—mt] dt 
| w f $ 2 p 
=p! "i Exp[-s]ds X m 
B f p > 


m=1 


(2.89) 


Thus, we identify 
Re[a] > 0, Re[£] > 0 = [zs wea po 'Tla + 1][a + 1] (2.90) 


with much less restrictive conditions upon the parameters. 


2.7 Integral Representations for Analytic Functions 


Special functions motivated by integrals for real variables can usually be extended into a 
portion of the complex plane simply by replacing the argument by a complex variable and 
employing a contour through the domain of analyticity of the integrand. Thus, one can 
define the complex sine integral by 


Si[z] = f Í A Sin[t] (2.91) 
0 


where the contour is any path between the indicated endpoints in the complex t-plane. 
This case is particularly simple because the integrand is entire — there is no danger of 
encountering singularities or branch cuts during integration. Similarly, the error function 
can be extended to the entire complex plane simply by using a complex argument 


Erf[z] = —= ES dte”  Erfelz T dte” (2.92) 
“yn y 
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. Re[r] 


Figure 2.9. Contours for Ci[z]. 


where an upper limit of oo is interpreted as a path that asymptotically approaches the 
positive real axis such that t > (R, 0) where R > oo. It is also useful to define the imaginary 
error function Erfi[z] = —i Erf[iz], such that 
an ae 
Erfilz] = —= dte (2.93) 
va Jo 
Conversely, if the integrand is multivalued or has singularities, the definition of a func- 


tion by means of an integral representation must constrain the contour enough to produce 
a unique value for the integral. For example, the integrand of the complex cosine integral 


. * dt 
Cilz] = -f F Cos[żź] (2.94) 


has a simple pole at the origin, with unit residue. Suppose that z is found in the second 
quadrant, as shown in Fig. 2.9. The integral along contour C,, which dips below the real 
axis for Re[t] < O and then approaches the positive real axis from below, differs by 2711 
from the integral for contour C,, which remains in the upper half-plane and approaches 
the positive real axis from above. (Imagine closing the contour for large Re[t] where the 
integrand is vanishingly small.) Contours which circle the origin several times differ by 
multiples of 271, representing various branches of a multivalued function. The princi- 
pal branch for this function is defined by the requirement that the positive real axis is 
approached from above without encircling the origin; this requirement is sufficient to pro- 
vide a single-valued definition while still leaving considerable flexibility in the choice of 
contour. 

The domain of analyticity will often be limited by requirements for the convergence of 
the defining integral. For example, the present definition for I'[x] is limited to x > 0, sug- 
gesting that straightforward extension to the complex plane would be limited to Re[z] > 0. 
However, we must still be wary of the branch cut needed to establish a unique value for 
the integrand when z is not an integer. Thus, for Re[z] > 0 we could use 


T[z] = f le” dt, Relz]>0 (2.95) 
0 
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with the contour in the left side of Fig. 2.10, where the positive real axis is approached from 
above. A more general definition can be made by using a theorem of analytic continuation 
that we will derive and discuss more thoroughly in a later chapter, which states that if 
the functions f,[z] and f,[z] are analytic in domains D, and D, and if filz] = falz] for 
all z € D, N D,, then f, and f, are representations of the same analytic function f[z] 
within their respective domains and f[z] is analytic throughout D, U D,. Therefore, if we 
can design an integral representation that provides identical values for Re[z] > 0 while 
avoiding the singularity in the integrand, we would be able to extend the definition of T [z] 
to the entire complex plane. Consider the function 


fiz = { fe" dt = if Exp[-+ + (z— 1)Log[r]] dt (2.96) 
C E 


for the inner keyhole contour around a branch cut on the positive real axis that is shown on 
the right side of Fig. 2.10 and is navigated in a counterclockwise sense. The small circle 
about the origin does not contribute when Re[z] > 0. The contributions from either side of 
the branch cut differ in phase according to 


t = x + le = Log[t] = Log[x] (2.97) 
t =x — le = Log[t] = Log[x] + 27 (2.98) 
such that 
fiz] = (Exp[2ziz] — 1) Í le~” dt (2.99) 
0 


Thus, we obtain a definition of the gamma function 


= 1 a 
Tz] = dal Í te" dt (2.100) 


that can be used for complex variables with positive real parts. The keyhole contour can 
now be deformed into the outer contour in the same figure without encountering any sin- 
gularities and without altering the value of the integral. We simply require that C enters 
from +00 just above the real axis and exits toward +co just below the real axis without 
crossing the positive real axis. Therefore, the proposed integral representation extends the 
definition of the gamma function to the entire complex plane. When Re[z] < 0, one simply 
avoids the immediate vicinity of the origin. 

It might appear that this integral representation for I'[z] has singularities for any inte- 
ger value of z, but the singularities for positive integers are illusory (removable). When 
z = nis an integer, 1”! does not require a branch cut such that the contributions from the 
segments of the keyhole contour above and below the real axis cancel, leaving only the 
small circle about the origin. Alternatively, in the absence of a branch cut the contour can 
be deformed into a closed circle about the origin. (We imagine that the original contour is 
closed across the real axis so far out that the integrand is vanishingly small.) When n > 0 
the integrand is analytic and the integral vanishes, leaving a 0/0 situation that suggests a 
removable singularity whose value should be determined by a limiting process that ensures 
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Figure 2.10. Left: contour for P[z] with Re[z] > 0. Right: more general contours for T [z]. 


continuity. Although we expect that the appropriate value is [[n] = (n—1)!, it is instructive 
to demonstrate that this result emerges from a suitable limiting process. Performing Taylor 
series expansions around z = n, 


t! x t"|Log[t}(c—n), Exp[2aiz] — 1 ~ 2ni(z — n) (2.101) 
we find 
1 
zan = Tld = ia f t" eLog[t] dt (2.102) 
E 


and can employ the keyhole contour for n > 0 without obtaining any contribution from the 
small circle around the origin because 


n > 0 => lim e"Log[e] =0 (2.103) 


Placing the branch cut for Log[t] immediately below the positive real axis, the contribu- 
tions from Log[It|] on opposite sides cancel, leaving the contribution from the phase of 
Log[t — ie] = Logllt]] + 277 to give 

1 


Tiz] = — 
lz 2ni 


f > tle" (2ni) dt = (n— 1)! (2.104) 
0 


as expected. When n < 0, we use 


a ni 
$ de = (7 (2.105) 


n! 


for a circle about the origin to discover that the residue for a simple pole at z = —n is 
(—)"/n!. Therefore, the integral representation teaches us that I'[z] is a meromorphic func- 
tion with simple poles at negative integers and determines its residues. It should be clear 
that the new integral representation offers much more detailed information than the origi- 
nal definition on the real axis. 

Many special functions have several different integral representations, with one or 
another being most useful under differing circumstances or for various purposes. These 
integral representations often provide the simplest or most general derivations for the prop- 
erties of a function. For example, in the chapter on Asymptotic Series we will use integral 
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representations to study the properties of several useful functions when |z| is large. There 
we take advantage of the flexibility of the contour to concentrate upon a segment which 
dominates the integral. However, one must be careful when deforming contours of infi- 
nite length because the Cauchy—Goursat theorem on the path-independence of integrals 
of analytic functions was derived for finite contours. When one or both of the endpoints 
is at infinity, it can matter whether the approach toward infinity is along the real axis, the 
imaginary axis, or some intermediate direction. Penalty-free movement of a “free end” 
requires that the contribution of an arc of radius R subtending the angle through which the 
free end is moved must vanish as R > oo. Although this requirement should be obvious by 
now, from the care with which we studied the return paths for closing a contour of infinite 
length, it still bears some repetition. The contours used for integral representations offer 
considerable but not unlimited flexibility. 


2.8 Using MATHEMATICA to Evaluate Integrals 


2.8.1 Symbolic Integration 


Integrals can be entered in typeset form using the BasicInput palette, but if you wish to 
modify any options you will need to use the command line. The basic syntax for an indef- 
inite integral is 


Integrate[f[x], x] 


Indefinite integrals like 


ansi -f a* dx 


-x Gamma[1 +n, -x Logla]](-x Logla])** 

are returned without constants of integration and can be checked easily by differentiation 

D[ans1, x] / / Simplify 

ax” 

The integral above is expressed in terms of the incomplete gamma function. 
MATHEMATICA makes no a priori assumptions about the variable of integration or the con- 
stants, so that the result applies to either real or complex variables and parameters. How- 
ever, it also makes no assumption regarding the values of parameters, returning results for 


generic values that are not always valid for special values of the parameters. Thus, the 
simple integral 


Ea dx 


xita 


lta 
is not valid if a happens to have the value —1. 
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The basic syntax for a definite integral is 
Integrate[f[x], tx, a, b}] 


The limits of integration may be numerical, symbolic, or infinite. In early versions any 
symbolic parameters or limits would also be interpreted in a generic sense. Thus, one 
would obtain a result for 


b n+1 n+l 
b+ -a 
n dx = ———— 2.106 
{ ae n+1 ( ) 
that would valid for most choices of parameters, but fails for n = —1, while 
b 
T x! dx = Log[b] — Log[a] (2.107) 


would fail for negative limits of integration. In more recent versions MATHEMATICA usually 
returns conditional results 


1 
Í x" dx 
o 


pl 
TÉ [Reta >-1, cca! Integrate ES {x, 0, 1), Assumptions > Re[n] < -1]] 
n 


b 
Í x" dx 
a 


coord eo 
a O Integrate [Ca + (-a+b)x)”, {x, 0, 1}, 


E E 


a-b -a+b 
b 
1 x“ dx 
a 
a a 


(-a+b) If [e| =] > 1 |[re[ 7] > o| 1m| 3] #0, 
a-b 


Int t | s £x, 0, 1} 
, Integrate | ———————, {x, 0, 1}, 
En a+ (-a+b)x 


A : a la f_2 | a , 
ssumptions >! |Re 21 pe > || Im #0 ; 
a-b -atb -atb 


that test any parameters that appear either in the integrand or the limits of integration. 
Although such results are less attractive and the conditions are often expressed in unnec- 
essarily complicated forms, there is less chance of obtaining incorrect answers due to 
careless unstated assumptions about the parameters. Although one can use the option Gen- 
erateConditions > False 


Integrate [x”, {x, a, b}, GenerateConditions > False] 


-qitn + pit 


1+n 
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to obtain a simple expression instead of a conditional statement, we strongly discourage 
that reckless practice and recommend instead that one supply the assumptions that apply 
to the parameters in your integrand directly, as follows. 


Integrate E, {x, a, b}, Assumptions > {Re[n] > -1,0<a< b}] 


-a12 + pit 

ttü 

MATHEMATICA claims to be able to produce practically any integral in standard compi- 
lations, such as the massive compilation by Gradshteyn and Ryzhik. Thus, one finds that 
many seemingly unpromising integrals can be evaluated in terms of recognizable functions 
even if the output appears complicated. However, despite its impressive versatility, there 
remains a nontrivial error rate in the symbolic integration package. We decline to present 
specific examples here because each revision of the program seems to correct some errors 
while introducing new ones. Nevertheless, investigation of most (but not all) disagree- 
ments you find with MATHEMATICA’ will eventually show that you have made a mistake. 
The software is good, but not perfect! Therefore, it remains useful to be able to perform 
such integrals independently. Furthermore, any result that is important should be checked. 
A very useful method for checking a symbolic integral is to compare with numerical eval- 
uation for representative choices of the parameters. This technique cannot prove that a 
result is valid for all parameters satisfying the requisite conditions, but it will sometimes 
find errors. 


Y Moral: trust but verify! 


2.8.2 Numerical Integration 


The basic syntax for numerical integration is 
NIntegrate[f[x], tx, a, b}] 


where the limits must be numerical and the integrand must evaluate to a number when 
given a numerical value for x. Numerical integration is generally more reliable than sym- 
bolic integration. Symbolic integration requires a vast library of pattern-matching rules for 
which it is difficult to ensure that all special cases are handled properly, while numeri- 
cal integration is much more mechanical. The basic technique for numerical integration is 
to sample the integrand at strategically located positions, construct an interpolating poly- 
nomial, and then integrate the polynomial. The accuracy of the integral can be tested by 
subdividing the interval and applying the method to smaller parts. One can also sample 
more points where the integrand changes most rapidly. Many reliable algorithms have 
been developed and the one used by MATHEMATICA is quite good. If it does encounter 
trouble with a particular integrand, there are options that can often be used to overcome 
those difficulties. If the difficulties persist, one should examine the integrand and handle 
its singularities more carefully. 
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2.8.3 Further Information 


More information about integration using MATHEMATICA’, including multiple integrals and 
symbolic and numerical contour integration, can be found in calculus.nb at my Essential- 
Mathematica website. 


Problems for Chapter 2 


You may use MATHEMATICA to check your work, but do not trust its symbolic integration 
too much. When evaluating integrals, you must specify the contour, convincingly justify 
neglect of any vanishing portions, and define phases near any branch cuts carefully. Be 
sure to consider special cases and be alert to possible generalizations. 


1. Some related trigonometric integrals 
Evaluate 


{ "dé for a>b>0 (2.108) 
o a+bCos[d] j 


using contour integration and then deduce 


2r 2r 
{ E — 4a f e a Ed (2.109) 
o (a+bCos[8]) o (a+bCos[6]) 


by more elementary means. 


2. Trigonometric integrals on unit circle 


Sino 
do f -1 1 2.11 
UN Dra 0 for <a< ( 0) 
271 
b) f Exp[Cos[0]] Cos[n6 — Sin[0]] d0 for integer n (2.111) 
0 
c) f Sin[0]?" 49 for nonnegative integer n (2.112) 
0 
a ae 
d) f EEIT] for arbitrary complex a, b (2.113) 


3. Magnetic flux through circle from coplanar wire 

An infinitely long wire carries current J. Compute the magnetic flux through a coplanar 
circle of radius a that is at a distance d from the wire, where d > a. Compare the limits 
d>aandd za. 
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4. Average power radiated by charged harmonic oscillator 
The angular distribution for the power radiated by a charge in simple harmonic motion 
with amplitude a and frequency w is given by 


Cos[wt |? 


= KSin[0]? - 
(1 + 6 Cos[6] Sin[wr]) 


P 
— 2.114 
40 ( ) 
where $ = aw/c is the amplitude for the velocity oscillation (relative to light speed) and 
K = e&a’w*/4nc’. Evaluate the angular distribution of radiated power averaged over a 
period. Finally, evaluate the total average power integrated over angles. 


5. Assorted integrals on infinite range 
Use contour integration to evaluate the following integrals. 


¡| 
a) f —dx where n> lis a positive integer. (2.115) 
o L+x" 


(Hint: try an arc subtending 27/n radians.) 


» f cosp costel dx with p,q real (2.116) 
© Cos[k 

c) P f cosl a dx with k real (2.117) 
zw b 

d) f Costar) ax for real a, b [bl > 0 (2.118) 
0 (x? + 1?) 

of. eB) dx for real a, b, Ibl >0 (2.119) 
(x +b’) 

in[20 
of. 2L dx for0<0<2n (2.120) 
(1 + 12) (1 — 2x Cos[6] + x°) 


> d: 
g) f — E with k real and nonzero (2.121) 
o Cosh[kx] 


6. An integral from diffraction theory 
Evaluate 


oa: 2 
f ah a dx (2.122) 


X 
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7. Integrals used in Fourier transform of oscillator wave functions 
a) Evaluate 


{ e“ Cos[bx] dx (2.123) 
0 


for a, b > 0. 


b) Produce a general method for evaluating integrals of the form 
T eo 2"! Sin[bx] dx (2.124) 
0 
with a, b > 0 and integer n = 0. Display explicit results for n = 0, 1. 
Integrals of this type arise in the Fourier transform of oscillator wave functions. 
8. Integration around branch cuts 


a) Evaluate 


l dx . 
witha > b>0 (2.125) 


1 (a+ bx)V1 -x 


using a suitable contour. (Hint: put the branch cut on the integration interval and define 
phases carefully.) 


b) Evaluate 


o] 
f ax dx (2.126) 
0 


using a suitable branch of Log[z]. 


c) Evaluate 


e (nx)? 


0 1+x 


(2.127) 


using a suitable branch of Log[z]. 
d) Evaluate 
1 1/4] — 43/4 
n aai o A dx (2.128) 
0 a +x) 

using a suitable contour. (Hint: put the branch cut on the integration interval and define 
phases carefully.) 
e) Evaluate 


f A (2.129) 
0 


1+ 4 


for complex a and specify the conditions required for convergence. 
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f) Evaluate 


i, — dx (2.130) 
0 


for complex a and integer n and specify the conditions required for convergence. 


g) Evaluate 


1 
f x* "(1 =x)“ dx (2.131) 
0 


for 0 < Re[a] < 1. (Hint: try a large circular contour and then deform the contour to snugly 
embrace the branch cut.) 


h) Evaluate 


00 xt 
al dx (2.132) 
0 x-a 
for real a > 0 and real0 <A < 1. 


©% 1⁄4 
iP d 2.133 
i) f E 3% ( ) 


9. Using S glx] dx > b. Log[z]g[z] dz for meromorphic integrands 
Consider an integral of the form 


f > glx] dx (2.134) 
0 


where g[x] is well-behaved on the positive real axis but is not symmetric with respect to 
the sign of x, such that the integration interval cannot be extended for use with a great 
semicircle. If the corresponding g[z] is a meromorphic function that decreases sufficiently 
rapidly for z > co and is sufficiently small at the origin, one can often use f[z] = g[z]Log[z] 
with a branch cut on the positive real axis and a PacMan contour of the form shown in 
Fig. 2.11. Apply this method for the following integrals. 


e dx 
o f (x + 1)(x + 2x + 2) (2.135) 


> q 
» f ae (2.136) 
o Xx +1 


10. Reduction to standard functions 
Express the following integrals in terms of standard functions. 


a) il x"e* dx fon>0 (2.137) 
0 
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y 


Figure 2.11. Contour used for 9 glx] dx > $. Loglzlelz] dz. 


b) f e" Sin[bx|dx witha, b > 0 (2.138) 
0 
LY de withe>1 (2.139) 
o (e -1y 


11. Using the beta function 

Express each of the following integrals in terms of the beta function and then in terms of 
more familiar gamma functions. Be sure to identify the conditions that must be satisfied 
by the parameters in each integral. 


1 

a) f ay dt (2.140) 
0 
1 

b) | w'(1-w) du (2.141) 
0 

e a 2.142 

bat dl 
1 

d) f a- +x dx (2.143) 
-1 


> (Sinh[x])* | 


(Cosbi? x (Hint: use Sinh[x]? > u and the result of c.) (2.144) 
0 osn|x 


e) 
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12. Beta probability distribution 
The probability density for the so-called beta distribution is proportional to 


Pix] oc x4-1(1 - x)?! (2.145) 


for 0 < x < 1 where a, b are nonnegative constants. Evaluate the normalization, mean, and 
variance for this distribution. 


13. oscillation period in potential V = kx?"/2 

A classical particle with mass m and total energy E oscillates in a potential of the form 
V = kx?" /2 where n is a positive integer. Determine its oscillation period. Can you gener- 
alize this result? 


3 Asymptotic Series 


Abstract. Asymptotic series describe the behavior of a function for large values of 
an argument. The leading asymptotic behavior can often be deduced from an integral 
representation using the method of steepest descent. Repeated partial integration or 
expansion of the integrand provide series that improve upon the leading approxima- 
tion. 


3.1 Introduction 


Often we must characterize the behavior of a function of z when |z| becomes very large. 
For example, in scattering theory we require the asymptotic behavior of Bessel functions. 
An asymptotic expansion for f[z] may take the form 


fiz] = plzlelz] (3.1) 


where ¢[z] represents the leading asymptotic behavior in terms of familiar functions, often 
exponential or logarithmic, while the asymptotic series 


00 


giz] = Y az* (3.2) 
k=0 
represents a correction factor expanded in inverse powers of z. Ordinarily we normalize 
the series such that a, = 1. However, in the common situation that the ratio f[z]/y[z] has 
an essential singularity at z > oo, the series for g[z] diverges. Nevertheless, for any finite 
value of z there may be a truncated series 


n 


g,kl= > ag* (3.3) 
k=0 
which provides a very good approximation to f[z]/y[z] with an accuracy that improves 
as |z| increases. The sequence of analytic functions f,[z] = ¢[zlg,,[z] is then said to con- 
verge asymptotically to fz]. More formally, the function f, is described as asymptotically 
equal to f[z] if 


jim a] — AED = 0 => fale] = fle] (3.4) 


such that their difference becomes arbitrarily small if |z| is sufficiently large. This relation- 
ship is represented by the operator ~ or sometimes ~. A series is asymptotically convergent 
if 

lim a(Z i 


LN g,lz1) 20 edit (3.5) 
plz] 
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for any positive n. This relationship requires that the difference between f, and f can be 
made arbitrarily small if |z| is sufficiently large for any choice of n. Of course, the smaller n 
is, the larger |z| would have to be to achieve a specified accuracy. 

Unlike more familiar convergent series, z”g,,[z] usually diverges for fixed z as n increas- 
es such that 


lim 2"g, [2] > oo (3.6) 


Thus, the accuracy of f [z] as an approximation to f[z] actually deteriorates if too many 
terms are included in the series. Given that the error in summing a Series is smaller than the 
first neglected term, the best approximation to f[z] for finite z is then obtained by truncat- 
ing the series at the smallest term (or, perhaps, the one before). Occasionally f,[z] is actu- 
ally convergent with respect to n even though there remains a slight difference from f[z] for 
finite z that decreases as z increases. We consider a series of this type also to be an asymp- 
totic approximation to f even though some authors limit that term to divergent series only. 
If f,[z] and g, [z] are asymptotic series for f[z] and g[z], one can show that 


f, [2] + galz] = fiz] + glz] (3.7) 
$, [21 x galz] = fiz] x glz] (3.8) 


Asymptotic series may integrated, but there is no guarantee that f’[z] is asymptotically 
equal to f’[z]. Finally, the asymptotic series for a given function is unique, but the same 
asymptotic series can represent more than one function in an asymptotic sense. For exam- 
ple, the asymptotic series for f[z] + e™ for Re[z] > 0 is the same as that for f[z]. 

In this chapter we explore a few of the methods that can be used to obtain asymp- 
totic approximations to some of the functions relevant to theoretical physics. We start 
with saddle-point methods because they often provide the simplest methods for obtaining 
the leading asymptotic behavior. In particular, we discuss the method of steepest descent 
in some detail because it is the most versatile while the closely related but more lim- 
ited method of stationary phase is developed in the problems at the end of the chapter. 
These methods are often suitable for analyzing integral representations that are obtained 
as approximations to the physical behavior of a system under specific conditions, whereas 
some of the other methods discussed later in the chapter are more suitable for analysis of 
mathematical expressions that are, in principle, exact. 


3.2 Method of Steepest Descent 


Often one encounters functions of the form 
n= | Feza (3.9) 
C 


where C is a specified contour in the complex f-plane and F is analytic in a domain includ- 
ing C. We assume that, for the relevant choices of the parameter z, the contribution made by 
the immediate vicinities of the endpoints are negligible and seek an approximation scheme 
that takes advantage of the topography of analytic functions. 
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Figure 3.1. Typical saddle with path of steepest descent. 


Suppose that the integrand can be represented in the exponential form 


F[t, z] ~ Explzelt, z]] (3.10) 


where g[f, z] is an analytic function of the complex variable t. It is useful to express z = re’? 


in polar form and to absorb the phase into that analytic function by writing 
z = re? => zelt, z] = r(u[t] + iv[t]) => Flt, z] = Exp[ru[t]] Explirv[t]] (3.11) 


where the real functions u[x, y] and v[x, y] are harmonic and £ = x+ iy. Recall that neither u 
nor v can have an extremum, so that any point tọ where g’[f)] == O must represent a simul- 
taneous saddle point of both u[x, y] and v[x, y], and that level curves of these two functions 
are orthogonal to each other. Therefore, v is constant on the path where u changes most 
rapidly. Figure 3.1 illustrates these relationships. The point fọ is a saddle point for both 
u and v. The path of steepest descent for u is a path of constant v. By integrating along 
a path of constant v, we avoid the delicate cancellations in the integrand due to the rapid 
oscillation of Expl[irv] for large r. By selecting the path of steepest descent in u, where 
u can be parametrized in the form u = ug — as? with positive œ, we ensure rapid con- 
vergence of the integral of j ent dt o f e7% ds. Thus, the method of steepest descent 
can be used whenever it is possible to deform the contour C so that it passes through a 
saddle point along this special path. The contribution made by the immediate vicinity of 
the saddle point will often provide a very good approximation when r is sufficiently large 
and thus represents the lowest term of an asymptotic expansion. The method can also be 
generalized for application to more complicated surfaces featuring several saddle points 
between the endpoints of the contour. One simply applies this analysis to each saddle point 
and adds their contributions. 
Near the saddle point we can expand 


t-i? 
2 + 


alt] = glito] +2” [to] (3.12) 


in parabolic form. It is useful to define 


z=re®, t—t)=se*, 2" [t)] = ge” (3.13) 
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where r and g, are positive real numbers representing the magnitude of |z| and the curvature 
of u at the saddle point, respectively. The angle y represents the phase of g”[f] near the 
saddle point. A straight path through the saddle point is represented by a constant value 
of ¢, describing the orientation of the path, and a real number s varying continuously from 
negative to positive values measuring the distance along the path relative to tọ. Thus, the 
argument of the exponential on a path through the saddle point is parametrized by 


2 
zglt] = 28, + Se (3.14) 


We are free to choose ¢ in a manner that optimizes our approximation to the contour 
integral. The path of steepest descent is selected by choosing ¢ according to 


n—0-y 


0+y+2¢ =r => = z” dt = se” ds (3.15) 
such that 
zglt] = zgo — as? (3.16) 
where 
821 
= 2 Hi7 
we (3.17) 


is large and positive for large r, provided that g, is not abnormally small (if it is small, we 
may need to carry another term). The integrand is then so sharply peaked around s = 0 
that we can extend the limits of integration to +00 with negligible error and it is easy to 
evaluate the resulting Gaussian integral. Hence, we obtain 


fiz] = Explzgo + id] i Exp[-as?] ds = Exp[zgy + id] iz 


(3.18) 
= Explzgy + id], | = 
82F 
Therefore, 
. 27 
fiz] = Explzgo + 161, | —— (3.19) 


golzl 


provides the first term in an asymptotic expansion for f[z] applicable for large |z]. 

To apply the method we must express the integrand in exponential form and select an 
appropriate saddle point of the argument of that exponential through which the contour 
can be deformed in a manner that ensures that the integral is dominated by the immediate 
vicinity of the saddle point. By identifying the proper contour phase, the leading asymp- 
totic behavior can be obtained quite easily. Often one can also develop a systematic series 
of corrections, thereby obtaining an asymptotic expansion in negative powers of z. 
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Alt], Exp[glel] 


t 


Figure 3.2. Integration of slow x fast functions. 


The method can be generalized to functions expressible in the form 


ZE f hit] Explzelt. zl] dt (3.20) 
iC. 


where h[t] varies much more slowly on the integration path than does Explzglt, z]]. Fig- 
ure 3.2 illustrates that if h[t] changes little over the width of the Gaussian approximation 
to the exponential factor near the saddle point of gl, z], then A[t] ~ h[f)] can be extracted 
from the integral such that 


fiz] = hito] Exp[zg, + id] call (3.21) 
galzl 
simply has an additional factor. 

Approximation is as much art as science. The method of steepest descent provides 
one strategy for obtaining a useful approximation to a particular type of function, but 
its application depends upon the nature of the particular problem. Rather than memorize 
Eq. (3.21), with its definitions of (fo, go, 82 ¢}, it is better to understand how the strategy 
works and then, if applicable, adapt it to the problem at hand. 


3.2.1 Example: Gamma Function 
The gamma function for Re[z] > 0 can be defined in terms of the integral 
T[z+1]= f Be” dt = f Exp[z Log[t] — t] dt = { Exp[zg[t]] dt (3.22) 
0 0 0 


where z = re'% and where 


alt] = Logit] - : (3.23) 
1 
r 


gi]=-- (3.24) 


XN |e 


1 
gI = “2 (3.25) 
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is an analytic function of t with a parametric dependence on z. The saddle point is defined 
by 


[ph] =O > =z (3.26) 
such that 
1 e 20 
gito] = Loglz]-1,  g”[t] = a (3.27) 


Thus, we identify 


1 
82 = > Y= nm —20 (3.28) 
r 
and solve 
0 
COPY TARA a oe (3.29) 


to determine that the path of steepest descent is along ¢ = 6/2. Substituting these values, 
we obtain 


0 
Tiz + 1] = Exp] z(Log[z] — 1) + Sy 2ar = V2nz Ze * (3.30) 
Finally, recognizing that P[z + 1] = zl [z], we obtain an approximation 
Liz] = V21 226% (3.31) 


that is useful for large |z| and Re[z] > 0. 

Figure 3.3 illustrates the path of steepest descent for the particular case z = 10(1 + i), 
for which 6 = 1/4 = > @¢ = 7/8. Contours for the real and imaginary parts of zglt, z] 
are indicated by solid and dashed lines, the saddle point is labeled by f,, and the path of 
steepest descent is shown as a thick line. The complete contour in the £ plane begins at 
the origin and is approximated by this line in a region surrounding f, that is large enough 
to accumulate most of the integral, and then approaches the positive real axis, £ > oo + 
ig, from above. In fact, by using the more general integral representation of the gamma 
function, given by 


= 1 z=l jt 
Tiz] = ak e` dt (3:32) 


for the positive contour indicated in Fig. 3.4 (recall Eq. 2.100)), one can demonstrate that 
the same asymptotic approximation also applies when Re[z] < 0. One simply deforms the 
contour to pass through the saddle point tọ = z with the proper slope. This generalization 
is left as an exercise for the reader. 

The accuracy of this approximation is illustrated in Fig. 3.5, which was produced using 
the MATHEMATICA code below. Curves are shown for 0 < 0 < 27 in steps of 7/8. Interest- 
ingly, the approximation remains successful even for rather small r, except in the immedi- 
ate vicinity of 0 = m where the poles in I'[z] are found; this is the curve which stands out. 
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11 


10.5 


10 


9.5 


Figure 3.3. Portion of a contour for the integral representation of P[10(1 + £)] near the saddle point 
for tọ. The heavy curve shows the path of steepest descent while level curves of u and v are shown 
as thin solid and dashed curves, respectively. 


Im[r] 


to Po O 


Re[z] 


Figure 3.4. A contour for Eq. (3.32) that passes through a saddle point t} for Re[z] < 0. 


Sometimes results of this type have broader applicability than suggested by the assump- 
tions employed in their derivation. 
Plot [Evaluate [Table [Abs [1 - Van zie? / Gamma [z]] /.z>r e}, 

{90, 0, 27, =] , Tr, 1, 10}, PlotRange > {Automatic, {0, 0.08}}, 


Frame > True, FrameLabel > {"r", "relative error"}] ; 


3.3 Partial Integration 
Suppose that a function is defined by an integral of the form 
fk] = f gl z]dt or flzl= [ate 2] dt (3.33) 
d 0 


where the independent variable appears in one of the limits of integration. Repeated inte- 
gration by parts can then generate a series in z leaving a remainder term in the form of an 
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0.08 
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0.02 
0.01 


Relative error 


Figure 3.5. Accuracy of asymptotic approximation to T'[re*”]. Curves are shown for 0 < 6 < 27 in 


steps of 7/8; only the curve for O = z is clearly separated for small r. 


integral that is hopefully much smaller in the limit |z| > oo. One example should suffice to 
illustrate the method. 


3.3.1 Example: Complementary Error Function 


The complementary error function 


erfc[x] = T f > e” dt (8.34) 
T Jx 


representing the fraction of the area in the wings of a Gaussian probability distribution is 
important in statistics. It is useful to re-express this function as 


2 p 1 œ dw 
erfc[x] = = | e” dt = — —e" (3.35) 
Vn Jx Vn Je yw 


and then integrate by parts repeatedly to obtain 


1 o 1” ee 
erfc[x] = 7 (Cuei — zS dwe “w dé 
i fe* 1 -3/2,,-w\> _ 3 E -w a 
= —_ A Ez = wW i d wW 2 
zl x Al wees | awe te (3.36) 
1 (e 1le% 3 E oe T a a) 
= = = > W q d w 
=($ E S wew 


The pattern should now be achingly clear. Thus, we write 


n 


erfc[x] = l +> Te | (3.37) 


2k 
E (2x5) 


where n should be chosen to give the best approximation for specified x. 
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Figure 3.6. Convergence for erfc[5]. 


Let 


n 


(2k — 1)! 
S, = 2% Ge ae (3.38) 


represent a sequence of partial sums. For any finite x the sequence eventually diverges 
because the terms a, begin to grow for k > k,,,,. The optimum number of terms is given 
by 


eal 
ak 


a OE Max| 0, Floor[x? — 3]| (3.39) 


2 max 


<15 


X 


where Floor[x] is the largest integer less than or equal to x. Figure 3.6, showing |a,|, demon- 
strates that for x = 5 the first term already provides 2 significant figures while the accuracy 
of the optimum sum is better than 10 significant figures. It is also clear that accuracy is 
lost if too many terms are taken. 

However, Fig. 3.7 plotting terms a, for x = 2, for which knas = 3, suggests that this 
approximation is limited to a few percent accuracy if x is as small as 2. These characteris- 
tics illustrate the basic nature of asymptotic expansions. 

A better approximation is available for series whose terms alternate in sign. Although 
the term a, is slightly too large in the figure above, we expect that S$, is slightly too small 
and we expect that a better approximation will be given by S$} + 344. Thus, it would appear 
that the best asymptotic approximation is obtained by averaging partial sums for k,,,, and 
Kmax + 1 terms, such that 


1 e* 
erfc[x] = —= = 


Vn x 


although we do not have a formal proof of that hypothesis. This technique of optimizing 
an asymptotic approximation is sometimes called Richardson extrapolation. The follow- 
ing MATHEMATICA function evaluates our asymptotic approximation to erfc[x]. Richardson 
extrapolation is used by default, but can be disabled by including the option Extrapola- 


(1 +S, + Las 41) (3.40) 
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Figure 3.7. Convergence for erfc[2]. 


tion > False. 

Clear [erfc]; 

Options [erfc] = {Extrapolate > True}; 

erfc[x_, opts__ Rule] := Module[{kmax, sum, extrapolate}, 


extrapolate = (Extrapolate/ .{opts}) / .Options[er fc]; 


1 
kmax = Floor [x? - 2) ; 


(2Range [kmax] - 1)!! 


sum = I f [kmax > 0, 1 + Plus@@ Cox?) Range Cima) 


2 


(2kmax + 1)!! 1 e* 
2 (-2x2) mand? 01,11; a 
The improvement obtained by extrapolation is illustrated in Fig. 3.8, which was pro- 
duced by the MATHEMATICA’ code below. Extrapolation clearly improves the approxima- 
tion, which is better than 1% for x > 1.3. The approximation is poor for x < 1, but 
improves rapidly. 


If[extrapolate, sum] 


er fc [x] er fc[x, Extrapolate > False] 
Er fc[x]’ Er fc [x] , 
{x, 1,2), PlotRange > A11, PlotStyle > {{}, Dashing[{0.02, 0.02}]}, 


Plot [{1 - 


Frame > True, FrameLabel > {"x", "relative error"), 

PlotLabel > "asymptotic approximation to erfc[x]", 

PlotLegend > {"with extrapolation", "without extrapolation"}, 
LegendPosition > {-0.2, -0.4}, 

LegendSize > {0.7, 0.3}, LegendShadow > None] ; 


3.4 Expansion of an Integrand 


Tf the function is defined in terms of a definite integral 


b 
Flz] = f glt, z] dt (8.41) 
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Figure 3.8. Asymptotic approximation to Erfc[x]. 


where a and b are fixed, while the integrand g[t, z] is amenable to asymptotic expansion 
with respect to z, the expansion for f[z] can be generated by integration term by term. This 
method is most useful when each term is sufficiently localized with respect to ¢ that the 
limits of integration can be extended without appreciable change in the integral. Again, 
one example should suffice to illustrate the method. 


3.4.1 Example: Modified Bessel Function 


Without deriving this integral representation at this time, we can use 


x2 E 7/2 
15 |= qe f Exp[- Sin[+]?] dt (3.42) 


to derive an asymptotic approximation to the modified Bessel function of the first kind with 
order 0, namely /¿[x]. This integral representation is convenient because as x increases the 
integral becomes increasingly dominated by small ft, which permits extension of the upper 
limit to co with little error. Hence, we consider the integral 


m/2 
fix] = | Exp[-x Sin[11?] dt (3.43) 
0 
and use a change of variables 
w = Sin[t] = dw = Cos[t] dt = y 1 — w dt (3.44) 
to write 
1 2? 1 2 4 
e 2.2 w 3w 
fled = [ === dw = awet (i TE] (3.45) 
o Vl-w 0 2 8 


or 


1 a z =I 
flx] = { dwe*“"|1+ A (3.46) 
i Li Fk! 
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If x is large, the Gaussian factor is sharply peaked in w and we can extend the upper limit 
of integration to co with little error such that 


a A (2k- 1)! 
fiz] = f dw e” fi + | (3.47) 
0 k=l á 
The integrals can now be evaluated using 
j —Aw? 1 T 
0 
= 2 ÓN 2n- 1)!! 
Mie f dv ey" = (-F) M= See (3.49) 


and the series becomes 
Fla] = a | + SE = i +i at it) 
S (3.50) 
Unlike the asymptotic expansion for Erfc[x], this series converges for |x| > 1 because 
LED <a 
2(k + 1)x 


and hence is asymptotically equal to f[x] for large x without need for truncation. Thus, we 
obtain an asymptotic series for 


L] e =) 2k- DNY _ 55 
A) x (3.52) 


k 


a 
Id > 1 = 4H 


(3.51) 


ak 


or 


x x (2k — DY 
hix] = a as] (3.53) 
=: ! 


k= 

The following function and accompanying plots explore the accuracy of this asymp- 
totic series for J [z]. Figure 3.9 demonstrates that the expansion becomes increasingly 
accurate as x increases, but even with just the first term of the series one obtains a few 
percent accuracy for x = 1. The series converges quite rapidly for x > 1. Figure 3.10 
shows that the relative error for knas = 5 decreases rapidly as x increases. However, this 
series must still be classified as an asymptotic series, even if convergent, because there is 
a minimum error for finite x that originates from extending the range of integration. 


error [x_, kmax_] := 
kmax 
(2k-1)!! 
(E DF S 2 > (Za Xk! y (2x)” "| Jbesserzto, x] 
Plot [Evaluate [Table [error [x, kmax] , {kmax, 0, 5}]], 
fx, 0,10}, PlotRange > {Automatic, {-0.05, 0.057), 


Frame > True, FrameLabel > {"x", "relative error"), 


PlotLabel > "Accuracy of Asymptotic Expansion for BesselI[0, x]"]; 
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Figure 3.9. Accuracy of asymptotic expansion for J)[x] as k pax increases from 0 to 5. 
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Figure 3.10. Accuracy of asymptotic expansion for /,[x] with kaas = 5. 


LogPlot [Abs [error [x, 5]], {x, 1, 100}, 
Frame > True, FrameLabel > {"x", "relative error"), 


PlotLabel > "Accuracy of Asymptotic Expansion for BesselI[0, x]"]; 


Furthermore, even though our derivation employed real variables, it is clear that the 
series is valid more generally for —5 < Arglz] < 5. However, as |Arg[z]] > 5 the accuracy 
deteriorates for given r. Figure 3.11 shows the r-dependence of the relative error in ire] 
forO <0 < = in steps of ¿ for kmax = 5. Although the error for small r increases with 0, 
this expansion still provides reasonable accuracy for modest k pax over a substantial range 
of polar angles. 


3x xn 


LogP1ot [Evaluate [Table [Abs [error [r e*?, 5]], (0,0, at mall $ 


{r, 1, 100}, Frame > True, 
FrameLabel > {"r", "relative error"}, 


PlotLabel > "Accuracy of Asymptotic Expansion for BesselI[0, x] "] ; 
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Figure 3.11. Accuracy of asymptotic expansion for /, [re] with kaas = 5 and 0 < 0 < = in steps 
of 7. 
8 


Problems for Chapter 3 


1. Landau straggling 

If a fast charged particle passes through a thin film of material, it will lose energy by ion- 
izing atoms via a stochastic process. Landau used a Laplace transform technique to derive 
the probability distribution for this energy-loss process. Without presenting the details, 
suffice it to say that the function 


1 c+ioo 
ias f pe? dp (3.54) 
2111 c—ioo 


plays an important role in the theory. The Laplace parameter c is an arbitrary positive real 
number. Use the method of steepest descent to derive an asymptotic approximation for this 
function that is useful for A « 0. 


2. stellar reaction rate 

A Star is powered by nuclear reactions in its core. Suppose that the average rate of nuclear 
collisions with center-of-mass energy between E and E + dE is Ne-"/"E dE where N is 
a constant and T is the core temperature in energy units. Also suppose that the probability 
that a collision with energy E results in a nuclear reaction is Me” VE where M and a 
are constants. Use the method of steepest descent to determine the reaction rate assuming 
that T « a’. 


3. incomplete gamma function 
The incomplete gamma function, Pla, z], can be defined in terms of the contour integral 


T[a, z] = Í dieta! (3.55) 


where a and z are complex and where the contour in the complex f-plane starts at z and 
approaches t > (+00, 0) smoothly. If necessary, a branch cut is made on the negative real 
t-axis. Use the method of steepest descent to determine the leading asymptotic behavior 
of the function f[z] = F[z + 1, z] when larg[z]| < 7. 


Problems for Chapter 3 109 


4. Airy function 
The Airy functions Ai[z] and Bi[z] are solutions to the differential equation 


w”[z] — zw[z] == 0 (3.56) 


and arise in boundary-value problems in quantum mechanics and electrodynamics. A use- 
ful integral representation of Ai[x] for positive real x is given by 


1 pa w? 
Ai[x] = =f Exp]: (us + 2] dw (3.57) 


Use the method of steepest descent to determine the leading asymptotic behavior of Ai[x] 
for large x. (Hint: choose the appropriate saddle point carefully.) 


5. Bessel function 
The Schläfli integral representation of the Bessel function J,[z] takes the form 


„= i on Exp|(r - -)3] (3.58) 


where the complex t-plane is cut on the negative real axis and where C enters from —oo 
just below the cut, circles the origin once in a positive sense, and then exits toward —oo just 
above the cut but is otherwise arbitrary. Use the method of steepest descent to obtain the 
leading asymptotic behavior, being careful to include both saddle points. 


6. exponential integral 
Develop an asymptotic series for the exponential integral 


E, [z] = Í er" dt (3.59) 
1 


for large |z| with n > 0. You may assume that n is an integer. How many terms can be 
included for finite |z|? 


7. asymptotic expansion of Fresnel functions 
The Fresnel functions for real variables are defined by 


X 2 X 2 
Ch] = f Cos] | at. Slx] = f si] a (3.60) 
0 2 0 2 


a) Determine the limiting values C[oo] and S[oo]. (Hint: use contour integration of an 
exponential on a wedge of opening angle 7/4.) 

b) Obtain asymptotic expansions for the Fresnel functions. (Hint: evaluate corrections to 
the asymptotic values by integrating from x to co by parts.) 


8. method of stationary phase 
The method of stationary phase is a somewhat simplified variation of the method of steep- 
est descent that is useful for Fourier integrals of the form 


n= f slo] Expliplow]1] dw (3.61) 


where the amplitude g[w] depends relatively slowly upon frequency w and the phase ¢[w] 
is real. For large times t > oo, the phase of the exponential factor varies extremely rapidly 
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with frequency unless ¢[w] is practically constant. Therefore, the asymptotic value of the 
integral tends to be dominated by points of stationary phase. 


a) Develop a general formula for the contribution of a point of stationary phase to the 
asymptotic behavior of f[t]. How does one handle several such points? 


b) Suppose that ¢’[w] + O anywhere in the interval w < w < w,. Use integration by 
parts to show that 


f dolido (3.62) 


1 


is of order 17! and, hence, that intervals lacking a point of stationary phase can be 
neglected in the limit t > oo. 


c) Suppose that a wave function is represented by 


00 2 
wx, t] = 1 |: wk] Expli(kx - wt)]dk, w= a (3.63) 
2m Jo 2m 
where 
Wk] = f i dxe Ix, 0] (3.64) 


is the spectral representation of the initial wave function and varies slowly with k 
if y[x, 0] is well localized. Evaluate the intensity, WP, for x > co andt > œ with 
constant ratio € = x/f. 


9. surface waves on deep water 
One-dimensional surfaces waves on deep water in a narrow channel take the form 


vin 1] = f Gx», tyly. 0] dy + f Glx-», My, 0] dy (3.65) 
where f[x, 1] = Of [x, 1]/0t and 
Gl t] = f Suit qe (3.66) 
o W 27 


with w = V elkl (g is the gravitational acceleration). Here W[x, 0] and Why, 0] represent the 
initial displacement and vertical velocity profiles and we evaluate y[x, t] for later times, t = 
0. (See A. L. Fetter and J.D. Walecka, Theoretical Mechanics of Particles and Continua, 
McGraw-Hill, 1980.) Use the method of stationary phase to evaluate G[x, t] fort > co 
and x > oo simultaneously in the constant ratio € = x/t. Sketch and interpret this function. 


4 Generalized Functions 


Abstract. The analysis of physical systems is often simplified using idealizations 
that represent an impulse being delivered instantaneously or a charge distribution 
being confined to a mathematical point without volume. Here we develop the Dirac 
delta function in terms of limiting properties of nascent delta functions. Similar 
methods are used for related generalized functions (or distributions). 


4.1 Motivation 


Often it is useful to idealize an external force as instantaneous or a charge distribution 
as two-dimensional. For example, if the response of a system is much slower than the 
duration of an applied force, it becomes useful to employ impulse, defined as the time 
integral of the force 


ty+T/2 
I af F[t]dt = Ap (4.1) 
ty-t/2 
in a limiting process where the duration tT > O approaches zero while the peak force 
F[t,] > œ such that the product TF [tọ] = Ap remains constant. Thus, we imagine that the 
entire change of momentum, Ap, is delivered instantaneously at time tọ. If the response 
of the system is much slower than the duration of the force, it becomes insensitive to the 
detailed shape of F [t] and responds only to the net impulse Ap. There is then considerable 
latitude in choosing a convenient representation for F[r]. 
Consider a Gaussian function 
_ Expl-37] 


O[x, o] = one (4.2) 
TO 


defined with unit integral 


f oldest (4.3) 


o0 


As the width o decreases, the height increases to maintain a constant integral. Thus, we 
could represent an impulsive force by 


F[t] = lim Loft, 7] (4.4) 


such that the integral 


f ‘Fig dt =I (4.5) 


€ 
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ó [xo] 


fix] 


fix], ô [x,0] 


x 
Figure 4.1. Integration of a broad function, f[x], against a narrow function, óLx, 7]. 


over any small but finite interval, |t| < e with e > 0, provides the same impulse Z. There- 
fore, one of the representations of the Dirac delta function 6[x] is offered by the limit 


Ó[x] = lim 6[x, o] = lim 


(4.6) 
a0 2107? 


However, ó[x] is clearly not an ordinary function — this limit vanishes for any x + 0 
and does not exist for x = 0. 


Exp|- | 


x+0= lim —=2 =0 (4.7) 
ao 0 vna? 
Exp|- 6, 1 

x>0= Pl=3) > > 00 (4.8) 


V 210? T V27 


Despite the fact that the limit of the integrand vanishes everywhere but the one point where 
it diverges, the integral remains finite, in this representation constant, and converges to 
unity. We describe ó[x, a7] as a nascent delta function, a delta function in the making. 

Consider an ordinary function f[x] which is continuous at x = 0. Despite the peculiar 
divergence of 6[x], the integral 


i fixlólx] dx = f[0] (4.9) 


converges simply to f[0]. Figure 4.1 illustrates the evaluation of this integral. We assume 
that f[x] is continuous in some interval around x = O and choose o small enough to 
ensure that f[x] is well behaved throughout the domain where 6[x, o] has appreciable 
strength. As o is reduced further, the product ó[x, o ]f[x] is approximated increasingly 
well by f[0]6[x, 7] because the variation of f[x] within the width of ó[x, 7] can be made 
arbitrarily small. Consequently, we can extract the constant f[0] from the integral 


2 f > fixlólx, 0] dx ~ f[0] 1 i ôlx, 0] dx = f[0] (4.10) 
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to obtain 


T FlxJólx] dx = lim T fix]ólx, 0] dx = f[0] (4.11) 


without reference to the detailed shape of ó[x, a]. Notice that the limits of integration need 
not be infinite, just much larger than o. 

Notice that the order of integration and limit operations are not interchangeable — one 
must evaluate the integral before the limit because the limit of the nascent delta func- 
tion, Ó[x, a], is not a respectable function. Therefore, ó[x] is only defined within the con- 
text of integration against a continuous function. The Dirac delta function is an example of 
a generalized function defined in terms of its behavior under an integral. Mathematicians 
prefer to call generalized functions distributions instead, but to this author the term distri- 
bution hardly seems applicable to an entity that vanishes at all but one point. Other useful 
distributions include the Heaviside step function, the square function, and derivatives of 
the delta function. Familiarity with the properties of generalized functions can greatly sim- 
ply the evaluation of integrals or the solutions of differential equations that involve large 
disparities between the spatial or temporal profile of one component and the response of 
another. Of course, one could employ only well-behaved narrow functions and evaluate 
the appropriate limits explicitly, thereby recreating a generalized function, but who has the 
time? 


4.2 Properties of the Dirac Delta Function 


If f[x] is continuous at xp, then 


f l Flrlólx — xp] dx = flo] (4.12) 
selects the value at x,. This property is easily verified using the change of variables 

y=x09 => | sulóialar= f rios (4.13) 
and the definition 

f i slyloly] dy = g10] (4.14) 
where 

gly] = fly + xo] = el0] = fll (4.15) 
Similarly, one can prove that 

a + 0 => f flx]ólax] dx = A (4.16) 
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using 
a>0= | striotadax= +f 2 orar = 2 = A (4.17) 
a ay La a lal 
a<0= |” flsidlar|dx= f " [2 |oay = -2 -47 (4.18) 
es a Joo a a lal 
Thus, we write 
a + 0 => ólax] = Se (4.19) 


as an equality even though neither side represents a function because the replacement of 
one side with the other is operationally correct within the context of an appropriate integral. 
Notice that this result suggests that ô[x] is even; hence, it is useful to require that property 
of nascent delta functions. 

How should one interpret an expression of the form ó[x? — a°]? The interpretation is 
based upon the behavior of that generalized function under an integral. We assume that 
a is real and, without loss of generality, that a > 0. Recognizing that x* — a? has roots 
at x = +a, we can separate the integral into two contributions 


co 0 00 
i. flxlólx? — a°] dx = f FEJLE + a)(x — a)] dx + f Flxló[(x + a)(x - a)] dx 
eS =% 0 


(4.20) 
where in the first integral 
P _ fl-a] 

x> -a => x-a > la => FlxJó[(x + a(x- a)] dx = Bal (4.21) 
while in the second 

x>a=x+a> dam | fileto- adr = AE (4.22) 

0 

such that 

a+0= f "e id= Aal (4.23) 


provided only that f[x] is continuous near both x = +a and that a + 0. Therefore, from an 
operational point of view, we may write 
Ó[x — a] + Ó[x + a] 


a+0= dl -a?] = (4.24) 
[2a] 


because both sides produce the same result when integrated against any suitable function. 
(Note, however, that 6[x] is uninterpretable.) 

Finally, we can generalize this procedure to evaluate 6| gLx]| where g[x] is a function 
with an arbitrary number of isolated roots. 
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Theorem 19. Suppose that g[x] has isolated simple roots at {x,,n = 1,2,...}, such that 


x x x, => glx] = (0x8 lx,] 


with g [x,] + 0. Then 


slx- 
slee] = Y vr (4.25) 


If the roots of g[x] are isolated, we may evaluate the integral 
co Xy FE 
f flxlo[gix]] dx = Y” f Flxl6[(x — x,)8 Dy] dx (4.26) 


as a sum over contributions from each root of g[x] where e is a positive number smaller 
than the spacing between roots. Then using the scaling property 6[ax] = 6[x]/lal, we obtain 


f f[x16[ glx]] dx = y | sey (4.27) 


Therefore, we obtain an operational definition 
Ó[x — x,,] 


4.2 
lg Ix] iis 


(glx,]=0,g/lx,] # 0) = dfe] = X, 


n 


that incorporates all preceding results from this section. 


4.3 Other Useful Generalized Functions 
4.3.1 Heaviside Step Function 


A useful version of the unit step function, known as the Heaviside function, can be con- 
structed by integration over a delta function, whereby 


@[x] = f "ma (4.29) 


serves as an operational definition. A more controlled definition of a nascent step function 
is based upon a suitable nascent delta function, such that 


Olx, o] = f oly, 7] dy (4.30) 
Ol[x] = lim O[x, o] (4.31) 


If we choose the Gaussian representation, then we obtain 


Exp| -¿5| 


6[x, 7] = 3 


= 0lx, 0] = ; (i + En] =|} (4.32) 


V20 


where Erf[x] is the error function. Figure 4.2 demonstrates that the transition between 
values of 0 for x << 0 or 1 for x >> 1 becomes very sharp as o becomes small. Furthermore, 
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Figure 4.2. Nascent step functions. 


because 6[x] is even, we obtain O[0, 7] = 1/2. Therefore, the Heaviside step function 
assumes the values 


0 x<0 
Olx]=33 x=0 (4.33) 
1 x>0 


Notice that O[x] is an ordinary function despite the discontinuity at x = 0. There are other 
versions of the step function defining the value at x = O to be either O or 1, but O[x] is 
the version that arises naturally in calculations that employ the Dirac delta function. Also 
notice that once the limit o > 0 is taken O[x] becomes independent of the representation 
of ó[x, 7], depending only upon the fact that the delta function is defined with unit area 
and the properties ó[x + 0] = 0 and ó[-x] = 6[x]. 

Despite the discontinuity of O[x] at x = 0, its definition in terms of integration over a 
delta function can be used to deduce an operational definition for the derivative of O[x], 
namely 


Ol[x] = Í ôly] dy = 0'[x] = ô[x] (4.34) 


co 


where the derivative of the unit discontinuity becomes a spike of unit area. This result is 
consistent with the behavior of the underlying nascent functions 


Olx, o] = f ôly, c] dy = €'[x, 0] = Ó[x, 0] (4.35) 


00 


4.3.2 Derivatives of the Dirac Delta Function 


Can one interpret the derivative of a delta function, 6’[x]? Once again we appeal to the 
Gaussian representation of a nascent delta function to evaluate 
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x 


Figure 4.3. Derivative of nascent delta function. 


Exp|-— x x Exp] — x 
ólx, 0] = Bof] = f'k, o] = ate =- i = | (4.36) 
V 2707 ay 27 


Figure 4.3 demonstrates that this function has two equal and opposite lobes centered near 
x ~ +o with area proportional to o™!. As o decreases, these lobes become taller and 
closer together. Thus, one expects integration of 6’[x, o] against a continuous function to 
produce a result of the form 


fiol- flo] 
Oo 


f Ñ filó Lx, 0] dx oc =P 101 (4.37) 


wherein the ratio between the difference of the function on opposite sides of x = O and 
the width of the interval is proportional to the derivative of f[x] and of opposite sign. 
Although 6’[x, 7] obviously does not yield an ordinary function in the limit 7 => 0, its 
behavior under an integral serves as the operational definition of a generalized function. 
Assuming that f[x] is differentiable near x = 0, integration by parts yields 


I fls Ly 0] dx = (flxló[x, De, - L. f [xloLx, 0] dx (4.38) 


The surface terms vanish for any reasonably well-behaved f[x], while the remaining inte- 
gral reduces to f’[0] in the limit 7 > 0, such that 


f j flx}6’ [x] dx = lim f > filó’ [x, 0] dx = - f > P'lxlólx] dx =-f'[0] (4.39) 


It is now straightforward to generalize this procedure to obtain an interpretation 


T FEl” [x] dx = (-)" f” [0] (4.40) 


of arbitrary derivatives, assuming that f[x] is differentiable at least n times at x = 0. Note 

that we use the notation 

d"S[x] 
dx” á 


fl] = a" fix] (4.41) 


ered = dx 
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4.4 Green Functions 


Suppose that a damped harmonic oscillator is subjected to an external force F[t] with 
finite duration and is at rest before the onset of that force. The displacement satisfies a 
differential equation of the form 


mx[t] + 2myx[t] + mux{t] == F[t] (4.42) 


where wọ is the natural frequency of undamped oscillations, y is the damping constant, 
and time derivatives are indicated by the dot notation. The homogeneous equation has 
solutions of the form 


mi[t] + 2myx{t] + mwžx[t] = 0 = x[1] = Exp[-yt](A Sin[wt] + BCos[wr]) (4.43) 


where A and B are constants and where we assume weak damping to obtain 


Y<W =>0=x0-Y (4.44) 


Now suppose that the mass is struck at time f’ such that there is an abrupt change in 
velocity. We assume that the impulse acts during an infinitesimal time interval that is too 
short for the position of the mass to change. Thus, the dissipative and restoring forces have 
no effect and we find that the instantaneous change in velocity 


F[t] > Lo[t —t’] > Ax{t’] = A = lim(+l! + £] -il — el) (4.45) 


reduces to simply //m, where J is the impulse delivered. Recognizing that the homoge- 
neous equation applies both before and after the impulse, solutions for those intervals take 
the form 


t < t => x[t] = Exp[-yt](A_ Sin[wt] + B_ Cos[wt]) (4.46) 
t >t’ => x[t] = Exp[-yt](A, Sin[or] + B, Cos[wt]) (4.47) 


where the coefficients are chosen to satisfy the boundary conditions 


xl! - e] =x[ -e] =0 = A_=B_=0 (4.48) 


x[t’ + €] = 0,411" + €] = £ =A, = dg Cos[wt’], B, = Ea Sin[wt’] (4.49) 
m mu mu 


such that the Green function 


Exp[-y(t — £)] 
mu 


Glr, t’] = Sin[w(t — t*)]O [1 - t’] (4.50) 
represents the response of the system at time £ to a unit impulse at time 1”. This response 
takes the form of a damped sinusoidal oscillation. The step function enforces causality — 
there can be no effect before its cause. Also, notice that G[f, t’] = G[t — t’] depends only 
upon the difference between the times because the homogeneous equation has no explicit 


4.4 Green Functions 119 


time dependence and thus is insensitive to clock offsets. We could have simplified the 
algebra (omitted anyway) by exploiting this symmetry from the beginning. The response 
of the system to a disturbance of finite duration can be evaluated by using the superposition 
principle for linear differential equations to sum the effects of a sequence of impulses that 
represents the time profile according to 


00 00 


Fit] =f F[t’]o[t — 1] dt’ = x[t] = xolt] +f G[t, t’|F[t’] dt’ (4.51) 


where x[t] is a solution to the homogeneous equation 
io [4] + 2yxq[t] + woxolt] == 0 (4.52) 


Often the boundary conditions x[—co] = x[-0o] = 0 specifying that the system is at 
rest before the action of an external disturbance require x,[t] = 0; if not, xp[t] is designed 
to match the appropriate boundary conditions. To demonstrate this solution formally, let 


L= mo + any + mul (4.53) 
dt? dt 9 
represent a linear differential operator such that the equation of motion becomes 
Lot] = Fit] (4.54) 
The homogeneous solution satisfies 
Lx lt] == 0 (4.55) 
while the Green function satisfies 


LGE, t’] == 6[t -t’] (4.56) 


where the operator applies to the first argument of G[f, t’] with the second held fixed. 
Applying £ to the proposed solution, 


x[t] = xp [1] + for 1']1F[1] dt’ = Lx[t] = fat, t’|F[t’] dt’ 


(4.57) 
= fa -t']F[t']drt = F[t] 
one recovers the original differential equation. Therefore, the function 
x[t] = x,[1] + le Gt, "JF [t] dt’ (4.58) 
with 
Gt, t’] = eee Sin[w(t — 1’) ]O[t — 1”] (4.59) 


provides a solution to the inhomogeneous problem in the form of a convolution integral 
in which the displacement at time £ is determined by the net response of the system to 
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external forces at all earlier times. The influence of a force at time £” upon a later time t 
is reduced by the exponential damping factor, such that recent forces tend to matter most. 
Notice that causality is enforced by the step function in Gl[£, t’] and that we may limit the 
range of integration strictly to the past, such that 


x[1] = xolt] + f G[t, ']F[r] dt’ (4.60) 


co 


should be interpreted as the sum of effects produced by past causes for all past times up 
to the present. Nevertheless, one often extends the upper limit of integration to co for 
convenience, confident that the Green function eliminates sensitivity to the future. 

Note that our derivation was more schematic than rigorous; for example, we implicitly 
assumed uniform convergence in order to move £ inside the convolution integral. Never- 
theless, it is consistent with the spirit of generalized functions and can be made rigorous 
by replacing the delta and step functions with the appropriate nascent functions and evalu- 
ating the limits more carefully. Although the G[t, t’] obtained here is actually an ordinary 
function, it is described as a generalized function because the inhomogeneous term upon 
which it is based is idealized as a delta function. It is also important to recognize that the 
Green function depends not only upon £ but also upon the boundary conditions. The solu- 
tion to 


LGft, t’] == o[t —1"] (4.61) 


is usually not unique without specification of the boundary conditions. The version we 
derived above is often described as a retarded Green function because the effect follows 
the cause (is retarded in time), and is denoted by G™[r, t’]. If £ describes a wave equation, 
we might be interested in an advanced solution, GO, or a standing-wave solution, GO, 
The present analysis is typical of the Green-function method for linear systems. In the 
next several chapters we will develop a variety of useful techniques for construction of the 
Green functions for a variety of systems. Often there will be several equivalent represen- 
tations and the most useful representation may depend upon its anticipated application. 


4.5 Multidimensional Delta Functions 
A point mass at 7” = (x”, y”, z’) is represented by a density distribution of the form 
pix, y, z| = mó[? — 7] = mó[x — x'Jó[y — y ló[z — 7] (4.62) 


where in Cartesian coordinates the three-dimensional delta function factors into a product 
of three one-dimensional delta functions of identical form such that 


[andado ar 
id V 


m f six- 1dx | oty-y'ldy f sz- z1az (4.63) 


m Trev 
0 rEéV 
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returns the mass for any volume that contains the particle and vanishes otherwise. If nec- 
essary, we can work with a spherically symmetric nascent delta function of the form 


ó[r — 7] = lim ôf’ - 7 li gig LE] 4.64 
A al rere a 
that represents a compact charge distribution and is normalized according to 
1 rev 
f s- o]dr=} * (4.65) 
v 0 rév 


Notice that a three-dimensional delta function has dimensions of inverse volume. Of course, 
many alternative nascent delta functions with these same basic properties are also avail- 
able. 

In curvilinear coordinate systems (spherical, cylindrical, etc.), one must include Jaco- 
bian factors for each one-dimensional delta function, with respect to a particular coordi- 
nate, that match the differential volume element. For example, in spherical coordinates 
(r, 0, @) we factor the three-dimensional delta function in the form 


ON gy _ R= 7'I 


aa rr Sin[ð] rr 


¿| Cos[9] - Cos[6’]]é[¢ — $] 
(4.66) 


such that 


a ; i , 27 
fa- @r= { olr aca, ar || pe Sin[6] ao f old - ¢'] do 
L o  Sin[6] 0 


$ ; 1 27 
E f dd i 6[Cos{6] - Cos[6’]] ACos[A] { s$- 0146 
0 -1 s 


rr 


=l 
(4.67) 


where for the polar angle it is often more convenient to employ ¢ = Cos[6] instead of 6 
directly. Also note for the radial delta function we chose to express the denominator in the 
symmetric form rr’ instead of either r°? or r?, but all three forms are equivalent because the 
delta function carries the instruction r > r’. Similarly, in cylindrical coordinates (€, ¢, z) 
we find 
_ A Olé = ¿'] , $ 

OP AE E (4.68) 

Suppose that charge q is distributed uniformly on a disk of radius a in the equatorial 
plane. The charge density can then be represented by 


3q 3q T q 
pls y, z] = 2 la - rló[Cos[0]] = 53 a - rialo - z = - ¿ola - ElóLz] 


na? 7 T 
(4.69) 


where the first two forms employ variations of spherical coordinates and the last employs 
cylindrical coordinates. The step function, constructed as an integral over a delta function, 
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is dimensionless. The coefficients must be chosen to ensure proper normalization of the 
total charge for each representation of the differential volume element. Again, physically 
realizable distributions could employ nascent delta and step functions but idealized func- 
tions can be used if we are not interested in details that depend upon the precise sharpness 
of the edges or the width of the disk. 


Problems for Chapter 4 


1. nascent delta functions 
Sketch the following functions for small positive o and demonstrate that they can serve as 
nascent delta functions. 


0 xl > a 
a) ólx 0] = i 

20 xl < Co 
b) ólx0]= E 
c) lx, 9d] = 1 Sinlx/e 


d ólx,0]= g (Sinal)? 


Briefly discuss their advantages and disadvantages. Consider, for example, lim „o ó[x, 7] 
for x # 0 and also the behavior of 
lim FlxJó[x, o] dx (4.70) 


when f[x] diverges at too. Contrast, where appropriate, with the behavior of the Gaussian 
representation. 


2. Fermi distribution as nascent step function 
The Fermi distribution 


nle, 7] = (e° +1)! (4.71) 


represents the occupation probability for a fermion state with energy e relative to the Fermi 
level at temperature 7 (in energy units). Show that as 7 > 0 this function is closely related 
to a nascent step function and deduce the corresponding nascent delta function. 


3. higher derivatives of the delta function 

Sketch 6’ [x, o] and ó” [x, o]. Discuss these curves in terms of the finite-difference formu- 
las for f’’ [x] and f” [x]. 

4. derivative of delta function with monomial factor 


Use induction to demonstrate that 


y" ni JO y] n=m=0 


x" [x] = ‘ Goa (4.72) 


m>n=0 


in the operational sense of a generalized function integrated against a suitable function f[x]. 
What conditions must be imposed upon f[x]? 
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5. recoil factor for knockout reactions 

Suppose that projectile a is scattered by nucleus A producing a final state containing the 
scattered particle b, a knocked out particle c, and residual nucleus B such thata+A — b+ 
c+B. Let (£, p,) represent the energy and momentum of a particle labeled i € (a, b, c, A, B} 
with mass m, and use relativistic kinematics throughout. If B is left in a discrete state of 
mass mp, the reaction calculation often involves a recoil factor 


R= fa, +E, + Ez — E, — Ma] dE, (4.73) 


defined by an integral over a delta function that enforces energy conservation, where we 
have assumed that the target is at rest such that e, = m, and where 
E 230 2 > = 242 2 : 22 2 2 

Ez = Pg +t Mg = (Pa Pp P) + Mg with pg = €f -mi (4.74) 
according to momentum conservation. (Note that we use natural units with the light speed 
set to unity.) Find an explicit expression for R in terms of the energies and momenta of c 
and B. (Hint: the kinematics of particles a and b and the direction of ù, are fixed while £} 
and |p,.| depend upon e..) 


6. Green function for stretched string 
Suppose that a string clamped at x = 0 and x = L is under tension T and is subjected to a 
static load F[x]. The displacement y[x] satisfies 


Ty" [x] = Flx] (4.75) 


for sufficiently small F/T. Note that F is the force per unit length. Evaluate the Green 
function, G[x, x’], which satisfies 


9? 
T-3Glx x'] = 6[x —x'] (4.76) 
Ox 
and express the general solution for an arbitrary load distribution in terms of the Green 
function. 
7. Green function for damped oscillator 
A damped harmonic oscillator satisfies a differential equation of the form 
mile] + 2myx{[t] + mwpx[t] == Flr] (4.77) 


where m is the mass, y is the damping parameter, wọ is the frequency of free undamped 
oscillations, and F is an additional external force. Evaluate and graphically compare the 
retarded Green functions for underdamped (y < wọ), critically damped (y = wọ), and 
overdamped (y > wy) systems. 


8. response of damped oscillator to step using Green function 
a) Use the Green function for a damped oscillator to solve 


X(t] + 2yi[1] + wkx] = £,011] (4.78) 


for a step function. Find explicit solutions for the underdamped, overdamped, and 
critically damped cases. Compare these three solutions graphically and explain their 
behavior. 
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b) Combine two of these solutions to determine the response to a square pulse 
X[t] + 2yx[t] + wxlt] == f,(©[¢] — Olt — 7]) (4.79) 


Plot the solutions for y/w, = 0.5, 1.0, 2.0 with 7 = 10/w, as functions of wot and 
explain their general characteristics. If this were an RLC circuit, under what condi- 
tions would the output follow the input most closely? 


5 Integral Transforms 


Abstract. Integral transforms find widespread application in the solution of differen- 
tial equations and the construction of Green functions that describe the response of 
linear systems to external influences. We illustrate these methods using the Fourier 
and Laplace transforms, but the technique is more general. Several practical exam- 
ples of the use of Fast Fourier transforms are also included. 


5.1 Introduction 
The function f[z] defined by 


b 
fiz] = f atK{[z t] fle] (5.1) 


is described as an integral transform of f[t] with respect to the kernel K[z, t]. The range of 
integration must also be specified in the definition of the transform. The variables ¢ and z 
are then described as conjugate to each other. It is useful to represent the integral trans- 
formation by a linear operator K acting on the function f to produce a new function f 
according to 


b 
E = fla f dtK(z, eflt (5.2) 


a 


where the functional relationship on the left-hand side is uncluttered by variable names. 
When it is useful to indicate symbols for the pair of conjugate variables, we can write 


fle] = Kflz] = KIF or $= KIA (5.3) 


depending upon which variable is of interest. If the integral transforms for two func- 
tions f and g both exist, the transform of a linear combination 


h=af + bg => h=af + be (5.4) 


is the same linear combination of their transforms because K is linear. Similarly, the 
inverse transformation can be represented by 


fer = fit] = fau, fa (5.5) 
C 


where K~! represents the inversion kernel (not the reciprocal of K!) and C is an appropriate 
contour. Thus, the inversion is also represented by a linear operator K!. 
Graduate Mathematical Physics. James J. Kelly 
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An infinite variety of integral transforms is possible, but the most useful are listed 
in Table 5.1. Note that we use the same tilde notation for the transformed function and 
trust that the intended operator will be clear from the context. The following sections dis- 
cuss the most important properties of these transformations and present examples of their 
application to problem solving. One often finds that application of an integral transform 
to a linear differential, integral, or integro-differential equation produces a much simpler, 
often purely algebraic, equation. Therefore, this technique finds widespread application to 
problems involving the response of a linear system to some external influence. 


Table 5.1. Common integral transforms. 


Type Forward transform Inverse transform 
Fourier f[w]= i dte” fit] fit] = Ł dwe Flw] 
Bessel f,{k] = J dax ikfid] flx] = f dkkJ, [kx] f,[k] 


Y+Hioo 


Laplace fis] = f dre flr] fle] = ES dse" Fis] 
Mellin — fz] = ff dt" fir fid= 4 E da~ Fiz] 
Hilbert fiz] = 2 f yi afl fil = 2 E az 


5.2 Fourier Transform 
5.2.1 Motivation 


We assume that you are already familiar with Fourier sine and cosine series and their value 
in the analysis of periodic phenomena. Suppose that f[t] is a piecewise smooth function 
for -7 <t< Z. By piecewise smooth we mean that there are, at most, a finite number 
of distinct discontinuities within |t| < 7/2 and that f[t] has continuous first and second 
derivatives in each interval between discontinuities. The Fourier theorem then tells us that 
at any point where f[f] is continuous, the series 


fit = E + 2 a, Cos[w,t] + 2, b, Sinfw,f], w, = E (5.6) 
2 7/2 

a, == f dt Cos[w,1]/11] (5.7) 
Tara 
2 T/2 

bp =2 f dt Sin[e, t fIr (5.8) 
Tira 


converges to the original function. We will not repeat the proof of this theorem here, but 
instead will use Euler’s formula to combine these series using exponentials with complex 
arguments, which often simplifies their manipulation and facilitates generalization to the 


5.2 Fourier Transform 
Fourier transform in the limit T > oo. Thus, 
1 
Cos[w,t] > 5 (Expliv, t] + Exp[-iw,t]) 


Sin[w,t] > a t] — Exp[-iw,t]) 


gives 
a 1< i , 
fit] = T +5 2, a, (Exp[iw,t] + Exp[-iw,t}) 
1 co 
+ > Dir (Expliw t] — Exp[-iw t}) 
= 04 0 - ib, ) Expliw,t] + Na + ib,) Exp[-ic,t] 
= 2 2 n n p n 2 n n p n 


n=1 n=1 


Recognizing that 


W_, =-0, = b_,=—b,, by=0 


=n n =n n 
we can combine these series by extending n to the negative integers, such that 
a, + A 
fit] = > (an + ibp) Exp[-iw,t] 
n=-—00 


Finally, we define 


a. + ib pp pe 


c => n= al dt Exp[iw,t]f[t] 


2 T/2 


to obtain the complex Fourier series 


00 


> c, Exp[—iw,t] 


n=- 


1 T72 
Cy zÍ dt Expliw,t) f[t] 
T Jr 


fit] 


subject to the same conditions as the sine and cosine series. 
Note that if f[t] is real, then 


=f >ġ =c 


=n 
Similarly, for even or odd functions one obtains the symmetry conditions 


fi-t] = fit] = Cn = Cy 
Fi] = - flt] = Ca = Cy 
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(5.9) 


(5.10) 


(5.11) 


(5.12) 


(5.13) 


(5.14) 


(5.15) 


(5.16) 


(5.17) 


(5.18) 
(5.19) 
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such that the combined reality and reflection symmetries become 


(5.20) 
(5.21) 


real, even = c} = C, 


real, odd => c} = —C, 


Therefore, the complex Fourier series contains the cosine series for even or sine series for 
odd functions as special cases. For more general functions, the coefficients are complex 
because both sine and cosine terms contribute in unequal proportions. The complex series 
is completely equivalent to the original Fourier series, but it is often easier to sum the 
exponential form than the trigonometric form. 

It is important to recognize that the Fourier series is periodic by construction. Hence, 
if f[t] is not actually periodic, the Fourier series constructed for the interval |t| < T/2 
does not represent the original function outside that interval. To produce a good approx- 
imation to a nonperiodic function over a wider interval, one must increase T. Thus, a 
faithful representation of a nonperiodic function requires the limit T > co where the spac- 
ing between frequencies becomes infinitesimal and the summation over w, becomes an 
integration over the continuous variable w. The Fourier series then becomes the Fourier 
transform described in the next section. 


5.2.2 Definition and Inversion 


A pair of functions related by 


j dw —iwt F 
fil = f mE flo] (5.22) 
flo] = f dte™ fit] (5.23) 


are described as Fourier transforms of each other. The apportionment of two factors of 
(271? between these definitions is purely a matter of convention — some authors multiply 
both integrals by (277)7!?, but I prefer to apply both factors to one of the integrals. It is 
often convenient to employ the operator notation 


f= Ff => Ffiol = dl dt f(r] (5.24) 
= = *d das 
e F' [Fio] = f ae flo] (5.25) 


where the more detailed version includes the input function as an argument and indicates 
the output variable explicitly. 
These forward and inverse Fourier transforms are related by the orthogonality relations 


f dt OY = 27 ólw-w'] (5.26) 


i dw e = 27 [t —1"] (5.27) 
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Figure 5.1. Nascent delta functions. 


that express the orthogonality of Fourier components with different frequencies. Although 
these formulas can be derived by careful analysis of Fourier series when the interval 
approaches infinite length, we prefer to employ the method of generalized functions. Con- 
sider the integral 


R . 
ik Sin[kR 
f Pio (5.28) 
=R C 
whose result is proportional to one of the nascent delta functions 

1 Sin[kR 

A Pa (5.29) 
m k 


discussed in the previous chapter, except there we used o = R7! > 0 and here we use 
R > oo instead. This function is sketched in Fig. 5.1 for two choice of R. The area of this 
function is obtained from 


œ Q; œ „kR 
f me dk = Im? Í = a| (5.30) 


00 00 


where we use the principal value of the exponential form because the singularity in the 
original form is removable. Closing with a great semicircle in the upper half-plane, which 
does not contribute, the integral reduces to i times the unit residue of the pole on the real 
axis, such that 


IN an dean (5.31) 


00 


Thus, the area is independent of R but becomes increasingly concentrated in a central lobe 
around k = 0 with height 
. Sin[kR] 
lim = 
k30 k 


R (5.32) 
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and width 7/R > 0 as R > oo; furthermore, the secondary oscillations are much smaller 
and their areas cancel almost completely. Therefore, we identify 


R R 
Í e dx = 2n6[k, R] => lim f e'* dx = 2n6[k] (5.33) 
-Ñ >00 =k 


to obtain the fundamental result 
f e dx = 27 6[k] (5.34) 


that underlies the theory of Fourier transforms. 

For the sake of completeness we should mention the conditions required for the exis- 
tence of a Fourier transform — the Fourier transform flo] exists if f[1] is piecewise smooth 
and is absolutely integrable, meaning that 


f [f[t]| dt < co (5.35) 
exists and is finite. Then 
Jlo] = 1 dt e” fit] (5.36) 


is uniformly convergent by the comparison test and f[w] satisfies the properties outlined 
below. Rigorous proofs of the Fourier theorem can be found in specialized texts, but are 
beyond the scope of ours. Furthermore, we sometimes find that it is possible to relax 
some of these conditions if we are willing to employ generalized functions. For example, 
below we will derive the Fourier transform of the step function even though it violates 
the condition on integrability. Thus, these conditions are sufficient, but not always strictly 
necessary, for use of the Fourier transform. 


5.2.3 Basic Properties 
5.2.3.1 Phase 


If we require that w be real, then inspection of 


Flo] = f 7 dte” fitl = Fio] = f 7 dte” FA (5.37) 
demonstrates that 

F = fN = Flo] = fio (5.38) 

FIN =-fl = Flo] = -Fio (5.39) 


such that the Fourier transform of a real function is even while the Fourier transform of an 
imaginary function is odd. 
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5.2.3.2 Reflection 

If f[t] has a definite reflection symmetry, such that 
fed =+f10 = Fi-a] = +flo] (5.40) 
fl-t] = -fi = fl-e] = -flo] (5.41) 


then f[w] shares that symmetry. One can then use either the cosine or sine transforms 
for w > O andt > 0, as discussed in a later section. 


5.2.3.3 Shifting and Attenuation 
Using 


F| flt -1]] = f ° dte” f[t — t] = f j dt’ e+ fir] (5.42) 


00 00 


one finds that the effect of a time shift on the Fourier transform is an additional phase 
factor, such that 


F| flt -1]|= e F| fir] (5.43) 


Alternatively, suppose that an exponential damping factor is applied such that 


Fle fi] = Í drehe” = Jlo + iy] G) 


00 


shifts the frequency into the complex plane. 


5.2.4 Parseval’s Theorem 


The integrated product of two functions is related to the integrated product of their Fourier 
transforms as follows. 


eS me e d aes “duo ., 
f _ ds flelg = f di f a f ce 
7 Í j = f O o añito (5.45) 
o 20 J- 2H 
dwr o, 
= f olga] 


If we know that f[t] and g[t] are real-valued functions, we can use the phase properties of 
Fourier transforms to express Parseval’s theorem in the form 


o [oe] d PA 
f. greal => f dt figit] = { 5, lila! (5.46) 
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such that 
f real => T di fra? = T ep = I. fte] 
3 oo 20 des 25 


* dw, ~ 3 
=9 sai 
{ zy Pl 
where the squared modulus is often related to the power absorbed or radiated by a system. 
Parseval’s theorem then has a simple interpretation — the total energy (power integrated 
over time) is the sum of the power found in each interval of frequency. Thus, |Flw]|? is 


proportional to the spectral power density or power radiated per unit interval of frequency. 
Naturally, there are variations of the normalization convention. 


(5.47) 


5.2.5 Convolution Theorem 
Suppose that f = g @ his defined by the convolution integral 

fil = f glt 1 1h[0] dt’ (5.48) 
where g[t — t’] represents the contribution to f[f] made by a source h[1”]. Convolution 
integrals are found throughout mathematical physics and are often easiest to evaluate using 


Fourier transforms. The Fourier transform of f can be related to the Fourier transforms of g 
and h, as follows. 


flo] f É dte" f > dt’ gft — t’ Jh{t’] 


f dt f dt'e git — 1111] 


y Í E J T Aegi -Ne h) (242) 
= (A dse*e{s}} h TA) 
= gloi] 


Notice that we inserted a factor of unity, represented as e9 e", into the third line in 
order to factor the double integral. This useful trick often simplifies integral transforms. 
Thus, we find 


Fig @ h] = AGA = flo] = ¿[w]h[w] (5.50) 


assuming that both and h exist. Therefore, the Fourier transform of a convolution of two 
functions is the product of the Fourier transforms of the individual functions. Be aware, 
though, that other conventions for normalization of the Fourier transform result in different 
factors of 27 in Parseval’s theorem and/or the convolution theorem. 
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Observe that the correlation operator is commutative, associative, and distributive 


h®g=g@h (5.51) 
¿8(1M8p)=(28M8 p (5.52) 
g8 (h+p)=880h+88p (5.53) 


if all required integrals exist. These relationships can be demonstrated directly using the 
temporal definition, but almost trivial using the convolution theorem because multiplica- 
tion shares these properties. 


5.2.6 Correlation Theorem 


Suppose that the functions g[t] and A[t] become similar if their relative time scales are 
shifted to match their shapes as closely as possible. We would then describe those functions 
as correlated, at least over a finite range of times. A quantitative measure of the degree of 
correlation between two functions, g[t] and h[1], is provided by the cross correlation 


Cle, hlii] = Í > elt + 41101 dt’ (5.54) 


where the complex conjugation of one factor ensures that the correlation between complex 
exponentials 


g= e™, Aft] =e => Clg, h] = 2n6(w, — w) Exp[-iw,(t -¢)] (5.55) 


reduces to a delta function in frequency with a phase that expresses the time shift. Although 
convolution and correlation are different, their similarities suggest that the Fourier trans- 
form probably provides a simple theorem for correlation also. Thus, if we let f[t] = 
Clg, h], we find 


Flo] = T dte” {. dt' glt ++1]h[1 1 


= T dt L dt'e g [t + t]he T 
= f di f ADE NETA ALT) (336) 
= ( f > segs] ( f > areni) 
= kwilo] 
Therefore, the correlation theorem takes a form 
Clg. h] = lolio (5.57) 


that is similar to the convolution theorem, but complex conjugation of one factor does 
have important consequences. Although correlation is associative and distributive, it is not 
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commutative: 
CIA, gllt] = C*Lg Al[-t] = CTA, gl = Clg, hl" (5.58) 


An important special case of correlation is the autocorrelation function, C[g, g], that is 
sensitivity of periodic or quasiperiodic behavior. The autocorrelation is obviously strongest 
at tf = 0 and would also exhibit peaks at multiples of the fundamental frequency for a 
periodic function, similar to the Fourier series. The closer to true periodicity the narrower 
such peaks would be. Thus, the autocorrelation function provides an important empirical 
tool for the study of dynamical or statistical systems. Furthermore, the Fourier transform 
of the autocorrelation function 


Cle, el = [aw]? (5.59) 


is proportional to the power spectrum. This result is known as the Wiener—Khintchine 
theorem and is important to the analysis of coherence in optics or the relationship between 
fluctuations and dissipation in statistical physics. 


5.2.7 Useful Fourier Transforms 


5.2.7.1 Gaussian 


Suppose that 
Exp|- a2 >| 
fid = > (5.60) 
2107 
is a Gaussian of width o and unit area. The Fourier transform 
Ff a ‘J fol y | å | 
= == e xp| === 
V 2 20 
ane (5.61) 


eee ae _(t- iwo" oe 
f ae -r | 


can be evaluated by completing the square; we have already demonstrated that such an 
integral can be treated as if the argument of the exponential were real. Thus, we obtain a 
Gaussian 


Ff = Exp[-w?0?/2] (5.62) 
with width 07! and area V27/c”. Notice that 


i f[e] dt = 1 => F[0] = 1 (5.63) 
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and that the widths of the two Gaussians are reciprocals of each other. For a generic func- 
tion g[x] it is useful to define moments 


M, =f glo” dx (5.64) 
such that 
M M. 
x=Q)=—, o =(œ-?)= 2 (5.65) 
Mo l ) Mo 


represent the mean and variance of g. Therefore, the variances g, and ©, of a Gaussian and 
its Fourier transform are related by 


0,0, =1 (5.66) 


w 


Similar relationships are observed for other functions that can be described as a localized 
peak — the narrower the distribution in one variable (such as time), the broader the distri- 
bution in the conjugate variable (such as frequency). The Heisenberg uncertainty principle 
in quantum mechanics is closely related to this reciprocal relationship between widths of 
a function and its Fourier transform. 


5.2.7.2 Chopped Sinusoid 


Next, consider the chopped sinusoid 


0 t<0O 
fi] = )Exp[-iw ot] O<t<T (5.67) 
0 T <t 
for which 
z oe Sin[(w — w,)T] 
w] = HA = 37 AAA 5.68 
wl = f l OE (5.68) 


is the nascent delta function plotted in Fig. 5.2. Although the Fourier transform of a finite 
wave train is peaked at its fundamental frequency, wọ, the limitation to a finite time inter- 
val +T spreads the central peak and produces substantial subsidiary peaks or side-bands. 
These characteristics are similar to those of single-slit diffraction: instead of chopping a 
wave front in space we are now chopping a wave train in time, but the mathematics is the 
same. 
An important special case is the rectangular pulse, for which wg > 0, such that 
E ETE Sin[wT] 
fle] = O[T - tell = flo] = f drei MO 


-T 


(5.69) 


clearly exhibits the reciprocal relationship between the widths of a function and its Fourier 
transform. 
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Pa ES WoT =4 


15t I i ===- wT=10 


w/wo 


Figure 5.2. Fourier transform of chopped sinusoid: Fle OIT <t< T]]. 


5.2.7.3 Step Function 


Often it is useful to idealize a quantity, such as a force or voltage, that has a rapid onset 
followed by a sustained level using a step function, but the corresponding Fourier trans- 
form requires special handling as a generalized function because the step function is not 
absolutely integrable. First we consider a rectangular pulse of finite duration, for which 


iot] 


FI = Olt] - Ol - t] = flo] = (5.70) 


iw 
In the limit that 7 > oo, the contribution of e” varies rapidly in phase and would tend to 
average to zero. This can be done in a more controlled fashion using a convergence factor, 


galt] = a O[r]e”" (5.71) 
such that 
zlo] = = E dt Exp[i(w + le)t ] oe 
= lim (E zeka = lim - 
e>0* i(w + i£) o =&=0* w +iE 


where we leave the limit unevaluated because we recognize that this result must be treated 
as a generalized function. Hence, we find that the Fourier transform of ©[f] is given by i/w, 
but this is not a rigorous proof and we must verify that the inverse Fourier transform does 
provide the desired result. However, because the pole at the origin is on the integration path 
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In[w ] Im[w ] 


Re[w] 


Re[w] 


Figure 5.3. Contours used in Eq. (5.74) for Fourier representation of step function. 


for the inverse transform, we must evaluate its contribution carefully. Thus, the inversion 
integral 


x co —iwt 
lim =S : awf aD (5.73) 


e>0* 207 J œ W + iE 1 t>O 


with a pole in the lower half-plane just below the real axis for e > 0*, where closure is 
made either up or down according to the sign of t, does indeed produce a step function 
as sketched in Fig. 5.3. Therefore, the Fourier transform of the Heaviside step function 
becomes 


1 co —iwt i 7 
o= lim f e E 70 = tin 2 (5.74) 


230° 27111 wtis e0 wtie w* 


00 


where the notation w* implies addition to w of an infinitesimal imaginary part, ie, that has 

the effect of shifting the pole slightly below the real axis with the limit e > 0* taken later. 

Notice that this infinitesimal imaginary contribution is similar to the attenuation property 

of Fourier transforms, such that e represents the damping coefficient that suppresses infi- 

nite times as if we had applied a convergence factor, Exp[—ef], to the step function. 
Alternatively, consider the principal-value integral 


œ ,—iwt 1 1 0 
a f © do=1-20m=)1?2, 5 (5.75) 
2ni w 2 -4 t>0 


=o 


using the contours sketched in Fig. 5.4. Solving for 


1 p œ ¿—iwt 
el] = = - — f “do (5.76) 
2 2miJ, Ww 
and evaluating the Fourier transform of both sides, we obtain 
Í e1 Q[t] dt (5.77) 
1 co oo . P CO  —iW't 
=- f e” dt — Í dte” — el — do (5.78) 
On ee = 2ni Jo W 
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Figure 5.4. Contours used in Eq. (5.77) for Fourier representation of step function. 


where we assume that the time integration can be performed first. This result can be 
expressed in a compact operator representation as 


FO = 1Ó[0] + im (5.79) 


in which we understand that the delta function applies under a Fourier integral and that 
the principal value is to be taken for the singularity at the origin. This representation is 
equivalent to the former, except that now we imagine that the limit e > 0* is taken first 
and that the pole pushes the contour upward, making a small semicircular indentation in 
the portion of the contour that is on the real axis. Consequently, we interpret the “bump in 
the road” as 
i P 
FO = lim —— = zôļlw] + i— (5.80) 
+ le w 


e>0* W 


5.2.8 Fourier Transform of Derivatives 


Many applications of Fourier transforms exploit the simple transformation properties of 
derivatives to transform a differential equation into an algebraic equation for the Fourier 
transform. Assuming that the Fourier transforms exist for f [t] and its derivatives, the sim- 
plest method for deducing the Fourier transform of derivatives is to differentiate under the 
integral 
“dw z “dw _; fe 
fie af E flo] = FIA - | Gee “iof[0)) (5.81) 


00 00 


and thus to identify 


FİFA] = -ioF|f111] (5.82) 


Therefore, each derivative in a differential equation can be replaced by a power of (—iw) 
according to 


ô ; 
> ae (5.83) 
a — (-iw)" (5.84) 


ot” 
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5.2.9 Summary 


A brief table of Fourier transforms and their properties is given in Table 5.2. The para- 
meters T, Wo, y, 7, and A are real. We also assume that the required derivatives exist and 
integrals converge. Be careful in applying theorems for derivatives and other operations to 


functions with discontinuities. 


Table 5.2. Brief table of Fourier transforms. 


Type fie] flo] 
definition fi = E Se flo] flwl = f> dte fir] 
step function ofr] lim, o 53 = 76[w] +52 
delta function ó[t — 7] plot 
chopped sinusoid eel — Itl] — 
shifting fit-7] eT Fiw] 
attenuation e`” fit] Flo + iy] 
dilatation fu] A7! flw/A] 
convolution f= f pa drglt - tlhit] Flo] = ¿[0]h[o] 
derivatives fP (-iw)" flw] 
multiplication by powers tf[t] -if’[w] 
Gaussian Exp - 5 |[N2x0 Exp[-w?0?/2] 
exponential ev 2nd[w + iy] 
elt] i 
w + ty 
trigonometric Cos[wot] m(d[w — w] + ó[w + w,1) 
Sin[w/] im(6[w — wg] — d[w + wl) 


5.3 Green Functions via Fourier Transform 


The Fourier transform can be very useful in analyzing the response of a linear system to 
external stimuli. In this section we illustrate this technique using a few relatively simple 
examples. 


5.3.1 Example: Green Function for One-Dimensional Diffusion 


Suppose that w[x, t] represents the temperature at position x and time £ relative to the 
background temperature for an effectively infinite homogeneous system. If the initial tem- 
perature y[x, 0] = f[x] is elevated, heat will diffuse to distant regions according to the 
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heat diffusion equation 


= -— (5.85) 


where the diffusion constant x is proportional to the thermal conductivity of the material 
and inversely proportional to its heat capacity per unit volume. Thus, we expect that the 
temperature for neighboring regions will initially rise as heat passes through them but will 
eventually return to ambient levels as the energy spreads out over an infinite volume. One 
method for determining the evolution of temperature throughout this sample is to employ 
a spatial Fourier transform to write 


yix, t] = | > AN ey t] (5.86) 
-æ 20 
Wk, 1] = i j dxe yx, t] (5.87) 


such that the differential equation becomes 


Rik, t] = A = Wik, t] = Wk, 0] Exp[-xxk?] (5.88) 


The initial condition requires 


00 


tk, 0] = FIA = f dxe™ fix] (5.89) 


00 


The inverse Fourier transform becomes 
AR gy7 5 
yix, t] = nE * f[k] Exp[-x«tk"] 


2 [dk f 238 De aa 
= i oF f de fix’ Exp|- (Ktk? — ik(x — 2’))| 


The integral with respect to k can be evaluated by completing the square in the exponential, 
whereby 


(5.90) 


_ ym 
e 2] (5.91) 


Akt 


yix, t] = (4raxty 1? f dx fix] Exp - 
takes the form of a convolution integral. 

Suppose that the region of elevated temperature is initially restricted to a plane, such 
that 


E | (5.92) 


Y E 
Fl] = podla] = Ws 0 = E Exp] 3 


describes both the time and spatial temperature variation produced by a localized distur- 
bance. This special solution is an example of a Green function representing the response 
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of a linear system to a localized perturbation. The Green function for a one-dimensional 
diffusion equation with a localized initial condition satisfies the equation 


+ 14 F , 
(Ja gq] Oe =a > 


(5.93) 


xy 
Gx — x’, t] = (4rxt) 2 Exp LL x’) 


Akt 


Notice that this Green function is a nascent delta function of Gaussian form with width 
parameter o = y2xt, such that 


lim G[x — x’, t] = Ó[x-—x'] (5.94) 


1>0 


Solutions for more general initial conditions are then obtained using the convolution 


00 


yix, 0] = fix] = vlx t] = f Gix - x’, t] fix] dx’ (5.95) 


Therefore, the Green function describes the essential dynamics of the problem and reduces 
problems with specialized initial conditions to an integral which can be evaluated numeri- 
cally, if not analytically. The Green function for this problem is sketched in Fig. 5.5. The 
delta function at x = 0 must be cut off for plotting. It is clear from the dimensions of the 
original differential equation that the spatial scale must be proportional to yxt. Thus, the 
width of the Gaussian temperature profile spreads at a rate proportional to £~! as the dis- 
turbance propagates. The Green function is often described as a propagator — it governs 
the propagation of a disturbance (cause) at one location or time to a different location or 
later time (effect). The net effect of a distributed disturbance is obtained from the folding 
or convolution of the propagator with the profile of the disturbance. 


5.3.2 Example: Three-Dimensional Green Function for Diffusion Equations 


In three dimensions the initial-value problem for diffusion within an infinite uniform 
medium takes the form 


1 ovr, t] -== 217,1], y, 0] = Yl] (5.96) 
k Ot 


where [7] is the initial spatial distribution and where we assume that x is a positive 
constant. We assume that the initial distribution is localized and at least piecewise contin- 
uous so that its Fourier transform exists in all three dimensions. Using a three-dimensional 
Fourier transform 


ak 
(27) 


Wir, t] = f Wik, t] Explik - 7] (5.97) 


Wk, 1] = f Wir, t] Exp[-ik - 7] d?r (5.98) 
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G[x,t] 


Figure 5.5. Dissipation of temperature disturbance (arb. units). 


we find 


ae = 0 (5.99) 


2 2 
G4- v?) Ju= o= | (5) ) Uk 0 Exptik "ome 


If this equation is to be satisfied for any 7, we must require the integrand to vanish. Hence, 
we obtain 


ES + e) Wk, t] = 0 => Wk t] = Jol] Expl=ader] (5.100) 
where 

Wolk] = f Yol] Exp[—ik -7] d°r (5.101) 
is the initial spectral distribution. To obtain the Green function, we substitute 

Yol] = Ol - 7'1] = Volk] = Expl-ik 7] (5.102) 
such that 


Gir -7,t] = 1 Exp[-xk?1] Explik - (+ — pe (5.103) 


Qr = 


can be factored according to 


3 
ci-7,1 =| | f Exp[-xk?1] Explik, - (r; — rise (5.104) 
j=l 
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where the index i runs over the three Cartesian components. Each factor is a one-dimen- 
sional Green function for the diffusion equation; hence, using our previous result, we find 
that 

Gir -7,t] = treo” 1/2 pxpp iN ac va] (5.105) 


i=] 


depends upon spatial coordinates through |7 — 7’|. Therefore, we finally obtain 


> _ 2212 
GIP - 71, t] = (4x2 exa- ] (5.106) 
such that 
WF. 1] = f GIP- PL Wel? 1B’ (5.107) 


is the general solution for the initial-value problem for the diffusion equation in a uniform 
medium. 


5.3.3 Example: Green Function for Damped Oscillator 


The Fourier transform of the differential equation 
8 0 
(r ae + 2mY + «ee == mf [t] (5.108) 
for a driven harmonic oscillator with linear damping becomes an algebraic equation 
(mu? — 2iwmy + k)x[w == mf[w] (5.109) 


where flo] is the Fourier transform of the driving force (per unit mass). Thus, we imme- 

diately obtain a particular solution of the form 
fle] 

2 


X[w] = oy 
wh — w? — 2iwy 


(5.110) 


where wg = Vk/m is the natural frequency for free oscillation. A complete solution can 
then be expressed in the form 


> dw Ho] 
x(t] = x, [1] + em 5.111 
ea IM 2n Wi — w? — 2iwy ( ) 
where x,[£] is a solution to the homogeneous equation 
8 0 
(5 tia +08) = 0 (5.112) 


designed to satisfy the appropriate boundary conditions. Often one can assume that x) = 0 
if f[f] is of finite duration and occurs after any earlier motions have decayed away due to 
the damping term. 
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Alternatively, we recently demonstrated that a general solution to this equation can be 
expressed as 


x[1] = xolt] + f GE —1']f[4"] dt’ (5.113) 
where the Green function satisfies 
a 0 
(= + Yo + maa == O[f] (5.114) 
with 
acir] _ 


t < 0 = G(r] =0, 0 (5.115) 


Ot 


We can demonstrate equivalence between these representations by constructing the spec- 
tral (frequency) representation of the Green function directly. Assuming that 


fit = ot] = flo] = 1 (5.116) 


is a unit impulse at time ¢’ and that the mass is initially at rest, we obtain 


3 1 
Wo — w — 2iwy 
such that 
œ dw elo! 
G[t] = 5.118 
[a] f 27 wh — w? — 2iwy ( ) 


is the corresponding temporal representation. Thus, we can use either of the representa- 
tions 


x(t] = xolt] + - f > e Glw]flw] dw (5.119) 


00 


xt] = xo 11] + f Gt- rfar (5.120) 


Just to belabor the point, we substitute the spectral representation of G[t] into the second 
equation 


00 00 —iw(t-t”) 
ad= f f de e fl’ dt’ (5.121) 


27 w -=u - 2iwy 


and perform the integration over t’ to obtain 


œ dw ew 5 
w 5.122 
o 20 wr =g = iy | l ( ) 


flo] = f > dr'e fi] = xit] = xolt] + f 


as before. Alternatively, we could have substituted f[w] into the first version and per- 
formed the integration over w to obtain the second. (Try it!) 
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To complete the analysis, we must construct explicit expressions for G[t] by inverting 
the Fourier transform. First, we consider the underdamped case for which the integrand 
has two poles 


Yy < w = w, =—iy+wgp with wk = Vor -7 (5.123) 


symmetrically placed about the imaginary axis in the lower half of the complex w-plane. 

In order to keep the exponential factor finite, we must close the contour with great semi- 

circles in the upper half-plane for t < t’ or the lower half-plane for t > t’. Thus, we find 

G[t, t’] = 0 fort < t’ because there are no poles in the upper half-plane while for t > t’ 
eat 


1 Bl loe 
t> => Git, 1] = ag ¢ z > eee dw = -i (REPARA +R eee d 
s = 


(5.124) 
where the negative sign accounts for the clockwise contour and where the residues are 


_Exp[-iw,(¢ = t’)] 
ap 


R = 5.125 

Y < Wo => + 2Wp ( ) 
such that 

y < w => Gt] = WR Exp[-y( — ¢’)] Sin[wa(t — 1)]O [1 — t’] (5.126) 


describes damped oscillations following a sharp blow. If the damping is small the oscil- 
lation frequency is near the natural frequency and the oscillations persist for a long time 
when the poles are near the real axis. Next, we consider the overdamped case 


y> Wy) => w, =-i(ytK) with k=x Y - 0 (5.127) 
where both poles are found on the negative imaginary axis with residues 


_Exp[-(y + V(t -1’)] 


R, = 12 
y > w = R, =F z% (5.128) 
such that 

y > w = Git t'] = K! Exp[-y(t — t’)]Sinh[«(t — t)]O[l — 1] (5.129) 


describes the damping of the initial velocity. (Note that y > x.) Finally, for the critically 
damped case the two poles coalesce into a single double pole with residue 


y = w = R = -i(t — 1”) Exp[-y(t —1”)] (5.130) 
such that 
y = w = GE, t’] = (t — 1) Exp[-y(¢ — 1]O [1 — 2] (5.131) 


These results are summarized in Table 5.3. Notice that the underdamped and over- 
damped solutions are related by the replacement wz > ix as y increases and that the crit- 
ically damped solution is related to either of the other two by the limit wg > 0 ork > 0. 
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Figure 5.6. Green function for damped oscillator. 


Figure 5.6 compares these functions for representative values of y/wy. All are damped 
exponentially but the underdamped solution also oscillates while the critically damped 
and overdamped do not. The amplitude is suppressed for the overdamped oscillator, but 
the duration can be long because the response is slow. Notice that Sinh[xf] contains expo- 
nentially growing and decaying components, but that because x < y the net effect is damp- 
ing with a relatively slow asymptotic form proportional to Exp[—(y — x)t]. By contrast, 
the critically-damped oscillator has a fairly strong response for a more limited duration 
because the damping is sufficiently strong to suppress oscillations but weak enough to per- 
mit a significant response to the impulse. 


Table 5.3. Green functions for damped oscillator. 


Type Condition Gft, t’] w, 
underdamped Y < w wa! Exp[—y(t — t’)] Sin[w,(t -rO -r'] -iy + wp 
overdamped Y > Wo K! Exp[-y(t — t’)] Sinh[x(t — ¢’)J@[t — 1] —i(y +K) 
critically damped y = w Exp[-y(t - tt - MoO[t -r'] —iy 


It is also of interest to consider the Green function for an ideal undamped oscillator 
with y = 0. This function can be obtained by evaluating the limit y > 0* in which the 
poles for an underdamped oscillator approach the real axis from below, whereby 


y > 0* = Gl1,1'] = o3! Sin[wy(t — t1Ofr —1"] (5.132) 


Ss Tag : y -1 ; aii a 
exhibits undamped oscillations with amplitude wo following a unit impulse. However, if 
we were to evaluate the Fourier integral 


, > dw eiL) 
=00 w — ww 
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directly, we would need a prescription for handling the poles on the real axis. In order to 
obtain a physically sensible result in which effects (oscillation) follow causes (impulse), 
we need to exclude the poles for £ < 1” and include them for t > t’. This prescription is 
realized by the substitution wọ > wọ — ty that recognizes that any real physical system 
should have a damping mechanism that will shift its resonances slightly below the real 
axis and provide convergence for the delayed response of the system. Of course, we must 
have y > 0 to ensure damping rather than spontaneous growth of small perturbations. 


5.3.4 Operator Method 
The differential equation for the damped oscillator can be expressed as 
9? 


- . ZE ð a 
Lit] = fit] with £= ap +2y +0 (5.134) 


where £ is a linear differential operator whose Fourier transform becomes 


È = -w - 2iwy + w => Li = f (5.135) 


The Green function represents the response to an impulse whose spectral representation is 
a constant, such that 


£G6=1—>G=f' (5.136) 
Therefore, a formal solution can be expressed in the form 
X[w] = Xp[w] + Glo] f[o] with Li, =0, G= Lf! (5.137) 


where X, is in the null space of the operator Í. The zeros of £ correspond to the poles of G 
where 


£=0 => w= -iy + (w2 - y)? (5.138) 


Thus, the poles of G in the complex frequency plane represent the resonances of the sys- 
tem. The oscillation frequency is determined by the real part and the damping by the 
imaginary part of the complex resonant frequency. Confinement of the poles to the lower 
half-plane is required by causality, such that effects follow causes, and the necessity that 
the effect of small perturbations decay rather than grow with time. 


5.4 Cosine or Sine Transforms for Even or Odd Functions 


For functions that are either even or odd with respect to reflection, such that f[—1] = +f[t], 
it is sometimes convenient to employ Fourier cosine or sine transforms defined by 


Ff felo] = Jelo] = f di Costu} fI] (5.139) 


F| finie] = fle] = f dt Sin[wt] fit] (5.140) 
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for w > 0. The inverse transformations are obtained with the assistance of the orthogonal- 
ity relations 


Hd E A = Sólo ~ w!] + 6[w +0’) (5.141) 

f ESTE 5 (dle ~ w']-ó[w +0’) (5.142) 
such that 

fol = +f) = fin = = { ” dw Coslut lela] (5.143) 

fl-fl=—flel = fir f " dw Sinot flo] (5.144) 


Obviously, the relevant representation is dictated by the reflection symmetry of the original 
function because the inversion of the cosine (sine) transform automatically produces an 
even (odd) function of t. 

The Fourier sine and cosine transforms can also be applied to generic functions with 
arbitrary reflection properties, but in such cases we need both. Decomposing a generic 
function into symmetric and antisymmetric components 


1 
f= 5 Tel + fi- = fit] = Pr Ver (5.145) 


with transforms 


Flo] = f ” dt Cosio fO => FP = z Í “dotala 6.148) 

fs [wo] = f 7 dt Sinjon fO => fOr] = z f "dosni iw (5.147) 
one obtains 

fil = z ii "da (J lo] Cos[wt] + fs [o] Sin{wr]) (5.148) 
The usual Fourier transform is then given by the combination 

Ho] = 2(7e[e] + if Lo]) (5.149) 
The details are left to the reader. 


5.5 Discrete Fourier Transform 


The continuous Fourier transform is extremely valuable for formal analysis, but often we 
are left with Fourier integrals that cannot be evaluated symbolically. However, numerical 
analysis using computer programs is usually limited to finite intervals and to discrete arrays 
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instead of continuous functions. Alternatively, one often samples the response of a physical 
system at a set of equally spaced times and wishes to perform a spectral (Fourier) analysis 
of such data. Therefore, it is of practical interest to return to the complex Fourier series 
but now using a discrete time variable. The discrete Fourier transform finds innumerable 
applications in the physical sciences. Among them are: 


e modeling processes where the response of a system is described as convolution of a 
driving force with a Green function; 


e analysis of data where the desired signal is convoluted with an instrumental resolution; 


e identification of periodic components using a power spectrum or an autocorrelation 
function; 


e suppression of noise by digital filtering. 


In this section we will survey some of the technology that facilitates practical applications 
of the Fourier transform to problems requiring either numerical methods or analysis of 
noisy data. However, this is a very broad subject and we cannot possibly study the spe- 
cialized techniques that have been optimized for various types of applications in any real 
depth. Our intention here is to familiarize the student with some of the underlying princi- 
ples, but the researcher will need to consult more specialized literature. 


5.5.1 Sampling 


It is useful to express times and frequencies as 


t,= (¡- DAr, J=LN (5.150) 

w, =(k-lAw, k=1,N (5.151) 
where 

T=(N-1)At, Aw=21/T (5.152) 


Similarly, discretized functions become 


fi=ftl h= Flod (5.153) 
and one anticipates that the discrete Fourier transform would take the form 
x- À ar 2ri 
f= 2 , Expl-iw;t;] = 2, É Expl- o j-1)X(k- D] (5.154) 


S fa ge ni 
h= 5 YF, Explioyt ] = y > 1, Exp| eG - Nk | (5.155) 
j=l j=l 
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where we have chosen an asymmetric but convenient normalization convention for which 
the first element of the transform 


e 1 
Jiz N 2415 (5.156) 


reduces to the average value of f. Note that some authors choose indices 0 < j < N — 1 or 
1- x <j< X for even N, but we chose 1 < j < N because that is usually most suitable 
for indexing the elements of an array within a computer program. 

To verify this analogy between continuous and discrete Fourier transforms, we need to 
demonstrate the discrete version of the orthogonality relation. Substitution of the Fourier 
coefficients into the series gives 


1 


in< aes 2ni 
h= Sy Explioglty -= y Fy Exel y DG A] 6.157 
La 


c=1 j’=1 k=1 j’=1 


We are free to interchange the summation order because these are finite series of finite 
elements, such that 


N N 
1 E 
fx > fy Ney 1 (5.158) 
j=1 k=l 
where 
z„ = Exp[2aim/N] = z = 1 (5.159) 


for integer m is one of the N™ roots of unity. When j’ = j = z j-j = 1, the inner 
summation simply consists of N terms of unit value. When j’ + j, the inner sum is a finite 
geometric series 


N 1 = pal 
ye = 1... 0 (5.160) 
Zm 


that vanishes for any nonzero integer m that is not a multiple of N. Therefore, the summa- 
tion over k yields a Kronecker delta function 


id 2ni 4 
DN -D| =ô; (5.161) 


that is the discrete analog of the Dirac delta function. The summation over j’ then selects 
just the term f; from the right-hand side, yielding an identity. 

The time required for straightforward computation of the discrete Fourier transform 
using the definition above scales with N*, but much more efficient algorithms that exploit 
the periodicities of complex exponentials accomplish the same task in a time that scales 
with N log, N. To appreciate the enormous savings realized by the so-called Fast Fourier 
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Transform (FFT), consider Table 5.4. Here the array dimensions increase by factors of 2, 
the second and third columns represent the number of operations for straightforward and 
optimized Fourier transforms, while the final column of ratios shows the factor by which 
FFT is faster. Note that we have chosen N = 2” on purpose — the maximum savings are 
realized when N is a power of 2. The FFT for even N is still better than the straightforward 
algorithm but the advantage is smaller; one should avoid odd or, even worse, prime values 
of N for which FFT is no better than the standard algorithm. If your sample set is not a 
power of 2, simply pad it with some extra zeros — the computational savings far outweigh 
the cost in extra memory and, as we will see below, zero-padding is often useful anyway. 


Table 5.4. Scaling of conventional versus fast Fourier transform with sample size. 


N N? Nlog,N Ratio 
512 262 144 4 608 57 
1024 1048 576 10240 102 
2048 4 194 304 22528 186 


4096 16777216 49 152 341 
8 192 67108864 106496 630 
16384 268435456 229376 1170 
32768 1073741824 491520 2185 


The first widely disseminated FFT subroutine was developed by Cooley and Tukey 
in the mid-60s and revolutionized numerical signal processing and related fields. Similar 
FFT programs are now standard tools for numerical analysis and we assume that you can 
find one in whatever computational environment you would use. Since FFT programs are 
now so widely available that hardly anyone would consider writing his/her own anymore, 
we will not discuss the details of those algorithms here. Useful discussions can be found in 
books like Numerical Recipes (W. H. Press, B. P. Flannery, S. A. Teukolsky, and W. T. Vet- 
terling, Cambridge University Press). 

An important feature of the discrete Fourier transform is its periodicity: 


le a (5.162) 


However, the discrete Fourier transform is intended to be an approximation of the con- 
tinuous Fourier transform that is suitable for numerical computation using computers, yet 
the primary motivation for the Fourier transform was its ability to describe nonperiodic 
functions. It seems that we have circled back to the Fourier series, but the Fourier series is 
capable, at least in principle, of describing continuous functions f[t] with arbitrary accu- 
racy (except at discontinuities) while the discrete Fourier transform represents sampled 
quantities, f., instead of continuous functions. On the other hand, arbitrary accuracy is 
not really possible numerically because an infinite number of Fourier coefficients would 
be needed, not to mention the inevitable truncation errors due to the finite number of bits 
available to the digital representation of numbers in a computer. The discrete Fourier trans- 
form represents N samples f, faithfully, whether they come from a smooth function or not, 
except for truncation errors related to machine precision. The price is unwanted periodic- 
ity and its effects upon transformations, like convolution, that require knowledge of the 
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Figure 5.7. Sampling of the temporal Green function for a damped oscillator. The region within 
vertical dashed lines is the working array. The exterior buffer zones contain zero-padding. 


underlying physical function f[t] outside its sampled range. It is useful to think of f, as 
a working array that contains sampled values f[1;] in an interior subset, / < j < m, with 
zero-padding in buffer zones on either side, such that f, = O for 1 < j< Lorm < j <N. 
The great speed of FFT means we need not be too concerned with increasing the array size 
somewhat with padding, provided that the size of the working array remains a power of 2. 
This arrangement is sketched in Fig. 5.7, where the data sample the Green function for an 
underdamped oscillator and where the interior and buffer zones are demarcated by dashed 
vertical lines. Note that the time axis is labelled by the sample index, often called a channel. 
Provided that f[t] is negligibly small in the outer ranges, as in this example, the artificial 
periodicity induced by the discrete Fourier transform would have no adverse effects on 
manipulations performed in the interior range and we would simply ignore output for the 
buffer zones. We will show a practical example of this procedure in the convolution exam- 
ple later. If f[t] does not decay fast enough when approaching the buffer zones, it might 
be necessary to impose a suitable decay by hand, but the details and consequences of such 
modifications vary for each case and would take us too far afield. The art of numerical 
computation is as interesting, important, and challenging as the more theoretical concerns 
of the present course but is beyond its scope. 

For the remainder of this section we will assume that all f; values are real, as befitting 
the sampling of a measurable quantity. The transform then exhibits the reflection symmetry 
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Thus, all f, can be reconstructed using only the first N/2 values of i the rest are redun- 
dant. Has information been lost? Not really, because N/2 complex numbers still contain 
N real numbers so that the transform is indeed a faithful representation of the input data. 
At least two samples are needed to determine both the amplitude and the phase of any 
Fourier component. On the other hand, if we think of {f;} as a discretized approximation 
to a continuous real function f[t], the limitation of the information content to the first half 
of the spectrum means that the discrete Fourier transform cannot reproduce any temporal 
variations with frequencies greater than w, = T/At, where At is the sampling interval. 
This maximum frequency is known as the Nyquist frequency and limits the accuracy with 
which discretized representations can reproduce continuous functions. If the Fourier com- 
ponents with w > w, are known to be negligible for our target function, then sampling 
works well and calculations using discrete Fourier transforms should be limited only by 
machine precision; if not, then we should reduce the sampling interval enough to achieve 
the required precision. The first step in many applications is use electronic or digital filters 
to suppress high-frequency components, which is especially useful when high-frequencies 
contain more noise than signal. 

Signals whose Fourier components are limited to w < Wmax < w, are described as 
bandwidth limited and are well-suited to the discrete Fourier transform. Unfortunately, 
if the input signal is not bandwidth-limited, high-frequency components can distort the 
lower-frequency information in the discrete Fourier transform. This phenomenon is known 
as aliasing because a component with w/w, = n will be mistaken for contribution to 
Mod[w, w,] because the high-frequency wave oscillates n times between samples. This 
nature of this problem is illustrated schematically in Fig. 5.8. Both sinusoidal functions 
have the same values at each sampled time and are thereby indistinguishable to the discrete 
Fourier transform; yet they are completely different between samples. 


5.5.2 Convolution 


The convolution of two continuous functions is defined by 


h= f 8g = hlt] = f firlelt - tT] dt (5.165) 
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Amplitude 


Figure 5.8. Schematic illustration of aliasing. The solid line represents an input signal and the verti- 


cal dashed lines the sampling times. Also shown are Cos[w_t] and Cos[2w.t]. 


where we assume that both functions vanish for |t| > co. The convolution theorem then 
tells us that the Fourier transform of a convolution 


h[w] = flw)g{o] (5.166) 


is the product of the Fourier transforms of its components. Similarly, for sampled functions 
we define 


N 
1 
h=198 => h =>) L8 40 (5.167) 
k=1 


where we assume that the meaningful values of sampled quantities are confined to the 
interior region of suitably padded working arrays. Note the index for g;_,,, accounts for 
the fact that k starts with | instead of 0. We now seek the discrete analog of the convolution 
theorem. By direct calculation, we write 


a - te 2ni 
h, = — Sh, Bxp| Ú -1k- | 
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(5.168) 
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to obtain 
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The summation over j reduces to NÓ kb which then eliminates the summation over / also, 
such that 


- Idd... 2ni 
he = y > 18% Exp|- on - Da 7) (5.173) 
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Next the summation over m reduces to Nô, , and we finally obtain 
h, = Fikr (5.174) 


as the discrete form of the convolution theorem with the present normalization convention. 

Before applying the convolution theorem to practical examples, we must consider the 
effects of the offsets used to place functions comfortably within the central region of the 
working array. Generally this means that the arrays are related to the underlying functions 
by 


fi= flt S] g; = elt; + sS] (5.175) 


where S, and S, are somewhat arbitrary time shifts in f and g, chosen to make their sam- 
pling more convenient. The continuous Fourier transforms of shifted functions 


Flo] = e 34% f 7 fite dt (5.176) 
Bw] = es E > ele” dt (5.177) 


include phase factors that depend upon the shifts. It is then useful to apply similar phase 
shifts to the corresponding discrete Fourier transforms, whereby 
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To illustrate the use of the convolution theorem with adjustable phase shifts, consider 
an underdamped harmonic oscillator with Green function G[f] and driving force F[t], such 
that the net displacement is given by 


x[t] = f Git - t]F[t] dt > x = fz (5.180) 
We have already derived the Green function using the continuous Fourier transform and 
found 


y < w = Git - T] = wp Exp[—y(t — 7)] Sin[wp(t — DJOL — T], 


A ET 


Even if we know the continuous functions f[t] and g[t], the convolution integral is usually 
too difficult to perform symbolically and we need to use numerical methods. Often we do 
not know the underlying functions and have only measurements made at discrete times. In 
either case, let f, sample the force and g, sample the Green function. For computational 
reasons we shift g, within the working array using zero padding on the left side, as shown 
in Fig. 5.9. Suppose that the driving force is a pulse with both positive and negative lobes 
centered upon t = 0. This time scale is also inconvenient for numerical computation, so 
we shift the sampled function into the working array. However, suppose that we were not 
too clever in our choice of shift and happened to place f somewhat too far to the right, as 
shown in Fig. 5.9. We then evaluate the displacement using the bare convolution theorem 
without compensatory phase shifts. The bulk of the resulting function is then rather far to 
the right of center and there appears to be a significant response for very early times before 
the driving force even becomes active. Does the model violate causality? No, this behavior 
is simply an artifact of the periodicity of the discrete Fourier transform and our injudicious 
sampling choices. After all, the Green function really vanishes for negative times. By 
placing it in the middle of the working array, the result of convolution is artificially shifted 
to the right. With a force that is also shifted to the right, the response goes past the end 
of the array and reappears, by periodicity, at the beginning. We might try placing g closer 
to the left edge of the array, but that could cause other unwanted wrap-around effects for 
functions that do not feature a sharp left edge. 

The solution to these numerical problems is to multiply Tk by a phase Exp[—271(k — 
1)S,/T | before computing the inverse Fourier transform of x. This phase compensates for 
the offset of g; and aligns the response with the driving force, as shown in Fig. 5.10. With 
a larger shift, we could also compensate for the poor placement of f, but we must not 
use a shift so large that it wraps around the left side and back onto the right. Because the 
indexing of sampled functions is merely a computational issue, we are free to adjust it in 
any manner that ensures numerical accuracy and convenience. 


(5.181) 


5.5.3 Temporal Correlation 


Suppose that two functions, f[t] and g[t], are qualitatively similar to each other, except that 
there is a time shift between them. If we had graphs of those functions we could estimate 
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Figure 5.9. Convolution without a phase shift. If either f or g is too close to the right side, periodicity 
of the discrete Fourier transform produces artificial strength near the left side of their convolution, 
an effect described as wrap-around. 


the time shift by sliding those graphs to the left or right until we obtain the best overlap 
between them. If these functions are quasiperiodic or include several features with different 
periods, there may be several shifts which result in significant overlap. The correlation 
function 


id . 
Cel y 2 Fis (5.182) 


provides a systematic method for evaluating the correlation between two sampled func- 
tions. The correlation function obviously closely resembles convolution and we can deduce 


ČLS gl Fe; (5.183) 
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Figure 5.10. Convolution using a shift of = —0.3125T aligns the response with the driving force and 
eliminate wrap-around. 


without further ado. Often one uses the autocorrelation function 


i< : - 
CLA 5 2, hah SGRIN (5.184) 
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to identify periodic behavior within a single time series. Thus, the Fourier transform of 
the autocorrelation function is simply proportional to the spectral power distribution. If we 
also employ ensemble averaging, spectral distributions in statistical physics are seen to be 
closely related to probability distributions for fluctuations about equilibrium. 

Consider the data {x,} plotted in Fig. 5.11. There appear to be three distinct bands but, 
without laborious scanning of the raw data, it is not obvious to this viewer whether or 
not there is additional structure within those bands or whether there is a pattern to how 
the visible bands are visited. Perhaps the simplest method for studying data of this type 
is to use standard mathematical software to evaluate Fourier transforms and to form the 
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Figure 5.11. Left: noisy time series. Right: autocorrelation spectrum. 


autocorrelation spectrum. This can be accomplished with only a few lines of code using 
MATHEMATICA, for example. We then obtain the accompanying figure for {C ¡Y notice the 
logarithmic scale. The strongest channel, k = 1, contains the square of the sum of (x;). 
Three other strong frequencies are clearly visible, plus four weaker frequencies. Therefore, 
these data actually contain an 8-cycle. (Note that alternation between two values, a 2-cycle, 
only corresponds to one frequency.) In addition, there is a spectrum of white noise that 
tends to obscure the patterns in {x i}. 

Now that we know there is an embedded 8-cycle, we can partition the data into groups 
of 8 and average the groups to obtain an estimated cycle (x;, j = 1,8) for which ran- 
dom fluctuations are suppressed by averaging; positive fluctuations tend to cancel negative 
fluctuations. Finally, if the underlying dynamics are periodic and deterministic, the value 
of x;,, is predicted by x;. Therefore, Fig. 5.12 plots x;,, versus x; for the cycle. We now 
see individual points that were obscured in the noisy raw data by random fluctuations. In 
a similar plot for the raw data without averaging, these pairs of points would smear out 
into indistinct blobs. These points lie within the bands in the (1;, x,} plot, but the highest 
two pairs appeared to merge into a single band in the noisy data. The lowest band was 
narrowest because the underlying pair has the smallest separation. These points appear to 
lie along a parabola, so we also show a curve ux(1 — x) where y is fitted to the cycle data. 

The data for this simulation were constructed using the logistic map 


Xizi = px (1 -x;) (5.185) 


with u = 3.55, which settles onto an 8-cycle after discarding enough of the initial itera- 
tions to allow transients to decay. We then added uniformly distributed random fluctuations 
to simulate the noise that might be encountered in the measurement of the response of a 
dynamical system. The noise amplitude was chosen to be large enough to smear the bands 
in x but small enough not to obscure the smaller frequency peaks in C. The unperturbed 
cycle for this y is actually {0.506, 0.887, 0.355, 0.813, 0.540, 0.882, 0.370, 0.828}, which 
agrees well with the extracted cycle. Thus, the upper band actually contains 4 values while 
the lower 2 bands contain 2 each, but it would take very sharp eyes, and perhaps some 
imagination, to discern that pattern within the noisy data. Nevertheless, with the aid of the 
autocorrelation spectrum we can discern the periodicities of the data and use that infor- 
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Figure 5.12. Cycle and recursion relation extracted from data in Fig. 5.11. 


mation to discover the underlying dynamics despite the superimposed noise. The present 
example is admittedly artificial, but it should be clear that this type of analysis can be very 
valuable in the study of real dynamical systems. 


5.5.4 Power Spectrum Estimation 


Estimation of the power spectrum for a persistent function f[t] that is not localized in 
time using discretely sampled measurements is a surprisingly tricky task for which there is 
much lore and literature; too much to survey here. The main problem is that the limitation 
of data to a finite interval necessarily sacrifices information about times outside that inter- 
val. Discrete sampling also limits the maximum meaningful frequency to w < w, = n/T 
for real functions or 2w, for complex functions. Finally, there are precision and noise 
issues that we will not discuss but which are important in practice. Here we will present a 
very brief survey of some of the issues in estimating power spectra but leave more detailed 
discussion to specialized texts. 

Suppose that f[t] = Exp[—iwot] represents a simple harmonic vibration with unique 
frequency wọ. Sampling necessarily limits the Fourier transform to a finite interval T. 
(Who can afford to watch the vibration forever?) Thus, the continuous Fourier transform 
over a finite interval becomes 


T > 
Fle] = Í Exp[iwt] Exp[-iw,t] dt = 2 Expli(w — agri Ie = 1/21 
0 


w — Wo 
(5.186) 
with a spectral power density 
; 2 
š 2 Sin[(w — w)T 72] 
Plw] = =(T .187 
[co] = [fo] (o-w,)T?2 (5.187) 


that peaks at wy but which is spread over a considerable range of frequencies when WoT 
is not large. Many periods of oscillation must be observed in order to measure wy accu- 
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rately, especially when the signal contains noise or measurement errors. However, actual 
measurements made are at discrete times t = T(j - 1)/(N — 1), not continuously, so we 
estimate the power spectrum using the discrete Fourier transform 
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(5.188) 


x 2ni f=d 
= > Exp y == Exp ic 1 | 


This expression takes the form of a finite geometric series that can be evaluated in closed 
form. After some tedious algebra, we obtain 


- (k-1 oT i 
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(5.189) 


The first factor is just a phase that does not affect the power spectrum. It is useful to define 
T = (N — 1)7 where 7 is the sampling interval, such that 


. 2 
= ( Sin[Nw,7/2] ) (5.190) 


Sin[w,7/2 — m(k — 1)/N] 
represents the discrete power spectrum. This spectrum exhibits a strong peak where 


k=x=1+[2% —m)w (5.191) 
277 

is close to a root of the denominator. Here m is an integer chosen to ensure that x is in the 
range | < x < N. Notice that we indicated approximate equality because k is an integer 
while x usually is not. The peak of the power spectrum then has a finite width, proportional 
to T~!, that is similar to the continuous Fourier transform of a finite wave train. When x 
actually is an integer, both the numerator and the denominator have coincident roots and 
we use L’ H6pital’s rule to determine that 


NwWot 


>K+mN-1= > f, > Nb,, (5.192) 


is consistent with the normalization of the orthogonality relation for the discrete Fourier 
transform. In principle, such a peak is limited to only one channel, but signal noise or 
numerical precision will generally produce nonzero amplitudes for nearby channels. 

The sensitivity of the power spectra to slight differences in frequency is illustrated in 
Fig. 5.13. Both figures use FFT with N = 2% = 256 channels. Notice the logarithmic 
scales. The peak at x = 86.33 for wyt = 27/3 is not integral, resulting in a noticeable 
spreading about channel k = 86. By contrast, using the very similar frequency wT = 
(27/3)(N —1)/N produces a discrete delta function at channel 86; the negligible, but appar- 
ently nonzero, power in other channels is due to round-off errors for numerical calculation 
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Figure 5.13. Power spectra for discrete Fourier transforms of Exp[—iw ft] that use wyt ~ 27/3. 


of the Fourier transform with a machine precision of 10~!°. The striking difference between 
these spectra is an artifact of discretization that has nothing to do with the nature of the 
underlying continuous function. In one case we chose, probably fortuitously, a sampling 
interval that divides the period perfectly. In real life our signals would not be pure sinusoids 
and we would not know the frequency well enough in advance to choose such a precise 
sampling interval — if we had such knowledge, we would not need to perform numerical 
analysis. Therefore, the panel on the left is the usual situation and shows that the spectrum 
is spread by sampling even for a pure sinusoidal oscillation. The panel on the right, on 
the other hand, requires precise calculations and delicate cancellations to achieve a dis- 
crete delta function. The alert reader might wonder how the discrete Fourier transform can 
produce a delta function using a finite observation time T while the continuous Fourier 
transform for the same observation time results in appreciable spread. The difference is 
that the discrete Fourier transform automatically assumes that the underlying function is 
periodic and persists forever while the continuous Fourier transform does not. 

The term —mN in the expression for x is a manifestation of aliasing — even when 
the frequency wọ is very large, there is still a peak within the sampled range of frequen- 
cies because discrete sampling cannot distinguish how many times a function oscillates 
between samples. Consequently, very high frequency contributions to a continuous signal 
corrupt the discrete power spectrum for lower frequencies. In Fig. 5.14 we chose a fre- 
quency for which w,7 = 2135 is much larger than 27N for N = 256. The peak power for 
the continuous spectrum should then be well beyond the end of this spectrum. Neverthe- 
less, we observe a peak at channel 205 corresponding to m = 339. Our sampling is much 
too coarse to obtain a realistic spectrum for this high-frequency signal. Signal processing 
cannot be performed by blind application of numerical algorithms. 

As our final example, we consider the behavior of the van der Pol oscillator, described 
by the differential equation 


xr] Ox[t] 
or ôt 
where € is a positive constant and x will be described as a displacement. This equation 


reduces to a simple harmonic oscillator when e > 0, but for positive e the velocity- 
dependent term pumps energy into the motion for small displacements and drains it for 


== —x[t] + e(1 — xit) 


(5.193) 
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Figure 5.14. Power spectra for discrete Fourier transform of Exp[-iw,f] with wT = 2135. The 
peak is an artifact of aliasing; the frequency of the actual signal is much too large for this sampling 
interval. 
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Figure 5.15. van der Pol oscillation, e = 10. 


large displacements. The net effect is to drive the system toward a stable “limit-cycle” 
oscillation that is decidedly nonsinusoidal for large ¢. This nonlinear differential equation 
cannot be solved symbolically, so we must resort to numerical methods. The solution for 
large £, long after transients have decayed, is shown in Fig. 5.15 for e = 10. The behav- 
ior is periodic but not simple, featuring relatively slow variations near either extreme, but 
with rapid transitions between those extremes. Obviously, the time sampling must be fine 
enough to represent the abrupt transitions. 

The discrete power spectrum obtained using 4096 samples in the range 200 < £ < 600 
is shown in Fig. 5.16. The peaks are found at odd multiples of a fundamental frequency, v,, 
that is approximately 21 channels. The fact that the harmonics are odd can be understood 
by observing that the nonlinear term responsible for those harmonics is odd. From the 
plot of x[t] we see that the oscillation period, p, is approximately 19 time units, such that 
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Figure 5.16. van der Pol spectrum, e = 10. 


vıp ~ T where T is the total observation time. The dashed line shows that the intensity 
of higher harmonics is roughly proportional to k 92, at least over this range. If the time 
samples represent observation of a physical system, the power spectrum can be used to 
study the underlying dynamics. We might, for example, attempt to guess a differential 
equation that has these features and fit its parameters to the observed behavior. 

After transients have decayed away, the behavior of this system is periodic. If we knew 
its functional form and period precisely, we could use the Fourier series to deduce the 
power in each odd multiple of the fundamental frequency, reducing the power spectrum 
from a sequence of peaks to a series of discrete spikes of zero width. The spreading seen 
in the discrete Fourier transform is partly due to the limitations of sampling in which the 
period is not divided into a precisely integral number of samples. We might be able to 
reduce the widths of these peaks if we knew the period in advance or could interpolate 
within the data array to improve the sampling interval. The mismatch between the sample 
interval and the period is analogous to chopping the fundamental frequency. The sharp 
turn-on at ¢ = 0 and turn-off at t = T spreads each discrete frequency into a peak of 
finite width. A closer approximation to a Fourier series of narrow peaks can be obtained 
by multiplying the sampled data by a window function w[1] that has a broad, relatively flat 
central region between smooth turn-on and turn-off regions. This technique is explored 
in one of the end-of-chapter problems. However, in real applications, additional spreading 
would be produced by measurement errors and noise. The effect of noise can be reduced by 
dividing the data into several subintervals, each containing an integral number of periods, 
and averaging the data for those subintervals to improve the signal-to-noise ratio. The 
discrete Fourier transform would then be taken for the averaged data. 


5.6 Laplace Transform 165 


5.6 Laplace Transform 
5.6.1 Definition and Inversion 


The Laplace transform £f is defined by 
Aral = fist = [arta (5.194) 
0 


and is useful for functions which vanish for £ < 0 and remain finite for t > 0, ensuring 
convergence of an integral transform that uses an exponential kernel. The Laplace trans- 
form is related to the Fourier transform with respect to an imaginary variable w > is. The 
primary difficulty in using the Laplace transform is defining and evaluating the inverse 
transformation £7*. Using the analogy with the Fourier transform, we might guess that 
the kernel for the inverse transform should take the form e7% > e*, but the exponential 
growth for £ > 0 clearly presents problems for convergence. Consider the Fourier trans- 
form of the function 


glt] =e" fit] = Bo] = Felw] = f dte” gli] = { def] (5.195) 
0 0 


whose inverse Fourier transform satisfies 


e” fit] =F [2] = > i > dwe ™™zlw] => fit] = + f > dwe""Ylw] (5.196) 
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The integral for f[t] will converge if y is large enough to ensure that e” g[s] > 0 for s > 
+ioo. The variable change s = y — iw then gives 


2111 Joico 


1 y+ioo 
fl] = =S dse"Fgli(s — y)] (5.197) 
Y 
where the Fourier transform of g evaluated for the imaginary frequency i(s — y) such that 
Feli(s — y)] = f dte”! g{t] = f die e” fit] = £Lf[s] (5.198) 
0 0 


reduces to the Laplace transform of f. Rigorous derivations may be found in more special- 
ized literature. 
Therefore, the Laplace transform and its inverse are defined by 


FIs] = £f = f dt e” fit] (5.199) 
0 

fl = £7 = 2 i. ds els] (5.200) 
21 J y—i00 


where y is a real number chosen to place the integration path to the right of all singularities 
in f[s]. The inverse Laplace transform is often called the Bromwich integral or the Mellin 
inversion formula. Although the Laplace transform was originally defined for real values 
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Figure 5.17. Contours for the inverse Laplace transform. We must choose y to place the primary 
vertical segment to the right of any singularities. Here we show just two poles; additional detours 
may be needed to avoid branch cuts in the left half-plane. 


of s, the inversion is performed in the complex s-plane using the domain in which f[s] is 
analytic. Contours for inversion of a Laplace transform are sketched in Fig. 5.17, where the 
heavy dots indicate isolated singularities. The region to the left of the vertical portion of 
the contour may also contain branch cuts, if needed to define a single-valued function f[s]. 
When t < 0 we close the contour using a great semicircle in the right half-plane, on which 
the integrand vanishes in the limit of infinite radius, and conclude that f[t < 0] = 0 
because no singularities are enclosed. This result is consistent with the requirements on f 
needed for application of the Laplace transform. When t > O we close the contour using 
a great semicircle in the left half-plane, with detours around any branch cuts. The residue 
theorem can be used to evaluate the contour integral, but to extract the inverse function we 
may need to subtract contributions of unwanted portions of the contour. 
Notice that when s > 0 the Laplace transform becomes the total integral 


s>0>£f= mar (5.201) 
0 


However, it is not necessary for this integral to exist for the Laplace transform to exist. 
Existence of the Laplace transform requires that for any positive M there exists a real 
number y such that |e~” f[¢]| < M is bounded for large t. A function satisfying this condi- 
tion is described as exponentially bounded. Thus, the Laplace transform of e% for positive 
A exists in the domain Re[s] > A even though its total integral does not. On the other hand, 
Exp[1?] does not have a Laplace transform because it is not bounded by an exponential 
of t. Nor does the Laplace transform exist for £” when n > 1 because the transformation 
integral is divergent at its lower end. 
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5.6.2 Laplace Transforms for Elementary Functions 


Let Oft] represent the unit step function 


O[t < 0] =0 (5.202) 
O[t > 0] = 1 (5.203) 


whose Laplace transform 
S 1 
LO = f dte“ @[t] = — (5.204) 
0 S 


exhibits a simple pole at the origin with unit residue. We will leave the value at £ = 0 
undefined for the moment, and evaluate it using the inverse Laplace transform. The inverse 
transform for £ + 0 is obtained using 


1 y+ico e’ 
ofz] = =Í ds— (5.205) 
2711 Jy—ic0 Ss 
where y > 0 may be taken arbitrarily small. Consider the contour integral 
1 St 
— Q ds— (5.206) 
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where C consists of a vertical line in the right half-plane closed by a great semicircle of 
radius R where R > oo. The only singularity of the integrand is found at the origin. For 
t < 0 we choose a great semicircle to the right to ensure that closure does not contribute 
and that the contour integral reduces to the inverse Laplace transform; hence, the inverse 
transform for t < 0 vanishes because no singularities are enclosed by C. For t > 0 we 
must close in the left half-plane and include two short horizontal segments represented 
by s = yf + iR where 0 < f < 1. The contributions of these segments vanish in the limit 
Expl(y8 + iR)t] 


R > œ because 
e 1 
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Thus, the contour integral again reduces to the inverse Laplace transform but this time has 
the value unity because a single pole with unit residue is enclosed. For t = 0 we must be 
more careful because the inversion integral takes the form 


e” 
<y >0 (5.207) 
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and the contribution of a great semicircle does not vanish. On the contrary, if we close to 
the right 


1 d i free Aor 1 1 
— gh Z = lim aa | Eee do) = @[0] - + = 0 = 0[0] = — 
2111 cs Roo 2711 y-iR S 2n n/2 2 2 


(5.209) 
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where the short horizontal segments can be neglected and where one choice encloses a 
pole, while the other does not. Either way, we find that the consistent definition of the step 
function in terms of the inverse Laplace transform of s”! requires Oft = 0] = 1/2. 

The same result can be obtained using y > 0 and a contour C consisting of the imagi- 
nary axis and an infinitesimal semicircular detour around the origin that ventures into the 
right half-plane. For t < O we again close C to the right while for £ > 0 we close to the left 
and recover the previous results. For t = O we obtain 


St 00 d 1 

¡=0= as =mi =P | % +i = 0[0] = - (5.211) 
Cc S ioo S 2 

and is composed of a principal-value integral that gives the inverse transform and the 

contribution of the semicircular detour. Therefore, combining these results we obtain 


1 y+ioo e" 
= f ds— = Oft] (5.212) 
2nt a sS 


—i00 


and recover the original function. Obvious, perhaps, but the unit step function is an implicit 
factor for any function for which one applies the Laplace transform because the require- 
ment f[t < 0] = O can be met by the replacement f[t] > f[t]O[t]. However, if f[0] + 0, 
there are sometime subtle problems in limits t > 0* related to the value of O[0]. These are 
usually harmless, but may require careful analysis. 

Next consider an exponential function for £ > 0 whose Laplace transform 


fit] = e“0[1] = FIs] = f dte“e" = — for s > Re[a] (5.213) 
A = 
exhibits a simple pole with unit residue at s = a. If the real part of œ is positive, the pole 
is in the right half-plane. Thus, poles in the right half-plane correspond to exponentially 
growing functions and convergence of the inverse Laplace transform requires y to be suf- 
ficiently large to overcome the strongest of these exponential features. Poles in the left 
half-plane correspond to exponentially decreasing contributions which pose no difficulties 
for the inversion procedure. The imaginary parts of these poles correspond to oscillatory 
features in the original function. The unit step function without an exponential factor is 
equivalent to an exponential with œ = 0 and contributes a pole at the origin. The Laplace 
transforms for trigonometric functions are easily deduced from that for an exponential, 
whereby 


= 5.214 
+0 ( ) 


[sintoni] = =( : : = 


S-IlW s+10w 
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if 1 5 
£|Cos[wr]| = sl: a =) => 3 (5.215) 
Notice that when 
1 
s > 0 = L[Sinfor]] > E,  L[Cosfwr]] > 0 (5.216) 


we obtain finite values for the improper integrals 


{ " @ Silo == (5.217) 

0 w 

1 atio (5.218) 
0 


that are consistent with earlier results obtained with the aid of a convergence factor (see 
Sec. 2.2.2). 

If the Laplace transform is a rational function, often the simplest inversion method is 
to expand in partial fractions and sum the resulting exponential functions. For example, 
given 


= sS 1 a b 
= = + 5.219 
Hs] (s — a)(s +b) ala Sy ( ) 
we can immediately determine 
ae" + be”! 
fl = 5.220 
Flo] Tih ( ) 
without performing contour integration. 
The Laplace transform has a simple shifting property 
fis - a] = | dte ®® fit] = Lie“ fin] (5.221) 
0 
which permits one to deduce 
w 
e” Sin[wt]} = ———.—, 5.222 
L| [wt] TE IT (5.222) 
+ 
Lje” Cos[wr]] = —— (5.223) 


(s+ y? + 


effortlessly. Conversely, if we apply a step function to f[t] to ensure that the integrand 
vanishes for £ < 0, as required for the Laplace transform, multiplying the transform by an 
exponential 


es] = 1 die OD fit] = {- dte™ flt- a] = m dte ™ fit — a]O[t — a] 
0 a 0 
(5.224) 
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has the effect of shifting the integrand. Hence, we find 
Ll flt - aJOlt - a]| = e% Fis] (5.225) 
Finally, consider the delta function 6[t — tọ] with tọ > 0, for which 
Ll 6[t — tol] = e” => Lle] = dt - to] (5.226) 
In the limit t) > 0*, we find formal results 
ty > 0* => L[6[e]] = 1 => £1) = slt] (5.227) 


that are often useful in initial-value problems. Although this result is similar to that for 
Fourier transforms, we must remember that the Laplace transform is limited to £ > 0. 


5.6.3 Laplace Transform of Derivatives 


The Laplace transform finds some of its most important applications in the solution of 
differential equations. The Laplace transform of a derivative 


LIPIA = f dte f'U = (FS +s f dte” fir] (5.228) 


can be integrated by parts. Assuming that f[t] remains finite as t > oo or restricting 
Re[s] > y, we obtain 


LIF] = s-£[flel| - £10*] (5.229) 


where the integration constant is evaluated in the limit £ > 0* for infinitesimal positive £. 
Iteration now gives 


Lif’ [el] = L| fle] - sf[0*] - £'[0*] (5.230) 


or, more generally, 


n-1 


eff te] re- Y 99107 (5.231) 


k=0 


Therefore, when we apply the Laplace transform to a linear differential equation, the initial 
conditions become part of the resulting algebraic equation, making this technique particu- 
larly useful for initial-value problems. The Fourier transform, on the other hand, is often 
more useful for boundary-value problems. 

It is also useful to consider derivatives of the transform 


fis] = I > dte~ fit] = f'[s] = f ° dte~" (-tf[t]) (5.232) 
0 0 
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whereby 
Ó ~ 
Ll flel|ts] == Fs] (5.233) 
Thus, the Laplace transform of t” f[t] is related to £f by the operator 
I= E (5.234) 
Os 
ð n 
t — (-5) (5.235) 
Os 


Alternatively, using 


e co 
7 al doe (5.236) 


and changing the order of integration in 


T dtt 'e™ fit] = f dt i doe fit] = f do do die" fit] (5.237) 
0 0 5 s 0 


we obtain 


L| fin] = f do fio] (5.238) 


Ss 


Thus, while positive powers of t are related to differentiation, negative powers of t are 
related to integration of the Laplace transform with respect to the conjugate variable s. 


5.6.4 Convolution Theorem 


Consider a function f[t] obtained by the convolution of g[t] and h[t — 7] over the finite 
interval O < T < t, such that 


t 1 
f=g@h=heg= fl] = f elr]h[tr - 7] dt = T hirt]elt — t] dt (5.239) 
0 0 
We wish to demonstrate that the Laplace transform satisfies a convolution theorem 


Lig 8 h] = Llgl£[A] (5.240) 


which is similar to that for Fourier transforms. We assume that both g and h are expo- 
nentially bounded so that their Laplace transforms exist. The Laplace transform of the 
convolution is then given by an integral 


T t 
fis] = lim f dte™ f drglt]hit — 7] (5.241) 
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over a triangular area in the (t, 7) plane. Assuming that we can change the order of inte- 
gration 


T T 
fis] = jim f drg[t] f dte“h{t — T] (5.242) 
e Jo T 
and using the variable change u = t —7, 
T T-t 
fis] = tim f drege | due *"h[u] (5.243) 
>> J0 0 


we obtain an integral over a triangular region of the (u, 7) plane. Recognizing that h[u] is 
exponentially bounded, the relative contribution to the inner integral 


T T T 
fis] = lim f dareste f due "h[u] — f due nul (5.244) 
7 Jo 0 Tr 
made by the region T — 7 < u < T becomes negligible as T => oo. Therefore, we obtain 
Fis] = ¿ls]h1s] (5.245) 
as desired. 


The important role of the Laplace convolution theorem in the solution of inhomoge- 
neous differential equations, similar to the analogous theorem for the Fourier transform, 
will be explored soon and in the exercises. It can also be used to evaluate Laplace trans- 
forms for functions defined by definite integrals. By interpreting a definite integral as a 
convolution with the step function 


t t 
f fit'|dt’ — f Olt -11f[1] de” (5.246) 
0 0 
we obtain 
t 
1 
«| f f plar! = —L[flel] (5.247) 
0 
whenever Lf exists. For example, using 
>i 
in{t > 1 . 1 
"| = f dos = T dx z = ArcTan[s”'] (5.248) 
t A o +1 0 l+x 
the Laplace transform of the sine integral function becomes 
Po 
Sift] = { Silt a = £[8i] = ATs (5.249) 
o * 


The convolution theorem can also be used as an alternative to the partial-fraction decom- 
position. For example, we can use 
t 


fl [(s sey te + wy} =w! { e Sin[ot”] dt’ 
0 


1 (E -1) + it] Expl-o(t - 1”) — ir] ) 
0 


2iw a+ iw a — iw 


(5.250) 


5.7 Green Functions via Laplace Transform 173 
to find 
e” — Cos[wt] + © Sin[wt 
L£'[(s+ ay Us? + y] = et ae) (5.251) 


+? 


5.6.5 Summary 


A brief table of Laplace transforms and their properties is given in Table 5.5. Note that 
we assume that 1,1, > 0 throughout and that initial conditions enter for t > 0*. The 
parameters tọ, w, y, and A are real. We also assume that the required derivatives exist and 
integrals converge. Be careful in applying theorems for derivatives and other operations to 


functions with discontinuities. 


Table 5.5. Brief table of Laplace transforms. 


Type fin Fis] 
definition fil = 4 ine dse” fis] fis] = f dte" fir] 
step function ofr] s7! 
delta function O[t — to] evo 
shifting fit- 1,]0[1 — t] e fis] 
attenuation e`” fit] Fis +y] 
dilatation flat] A7! FLs/A] 
convolution fit = f elrlh[r-1]dr Fis] =2[s]hls] 
derivatives f' 1] sf[s] — f[0*] 

fin s’ FIs] — sf[0*] - f’[0"] 
integral fiar s Fs] 
powers t” Tiv + 1]/5+ 
multiplication by powers tfit] —f'[s] 

fa f Folde 
exponential eft] 1/(s + y) 
trigonometric Sin[wr] w/( + w) 

Cos[wt] ss +?) 
periodic flt+nT] = git] Fis] = (1 -e Ty de dte g[r] 


5.7 Green Functions via Laplace Transform 


Like the Fourier transform, the Laplace transform can be very useful in analyzing the 
response of a linear system to external stimuli. An advantage of the Laplace transform is 
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that initial conditions are included automatically. In this section we illustrate this technique 
using a few relatively simple examples. 


5.7.1 Example: Series RC Circuit 


The current in a simple RC circuit with a variable voltage source V[t] satisfies 


1 1 

RI[t] + E f I|t'| dt’ = vir (5.252) 
0 

where we assume that the circuit is quiescent for t < O (no current and discharged capaci- 

tor). Although this can be converted into a differential equation for the stored charge, it is 

instructive to solve the problem as an integral equation instead. The Laplace transform for 

this simple integral equation gives 


s. bi z z 
RI + =- =V => I= 
Cs 1+sT 


CV (5.253) 


where t = RC is the characteristic time constant for this circuit. Therefore, the general 
solution takes the form of a convolution 


Ift] = f Glt -— t IV [1] dt’ (5.254) 
0 


where the Laplace transform of the Green function is 


Gls] =C (5.255) 


l+st 


The inversion can be accomplished either by using the Bromwich integral or by using the 
basic properties of Laplace transforms, whereby 


=  * i? p 1 Jers -t/r 
ol alta a ae 


= Ur le Oft] + eI t]) 


(5.256) 


We can drop the step function because this Green function is only used for t > O and we 
can evaluate the coefficient of the delta function at t = 0 to obtain the simpler form 


—t/T 
Git] = rs] =e (5.257) 
T 
such that 
Rift] = Vit] - r~! f Exp[-(t — t’)/t]V[t’] dt’ (5.258) 
0 


or 


RI[t] = V[t] — pel Exp[t’/t]V[t’] dt’ (5.259) 
0 
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For example, suppose that the voltage is constant for t > 0. Then 


t 
Vit] = VO] = RI[t] = V, t-te | Ex t/t] dt’ 
[1] = Y/OIr] [1] = Vo( : p[/7] dt’) (5.260) 
= VW (1-e (e^ - 1) 
such that 


Vit] = YOR] = RI[t] = Ve” (5.261) 


Thus, the current is initially determined by Ohm’s law for R and V, but decreases to zero 
for long times as the capacitor becomes charged. Although this derivation may be unfa- 
miliar, the result should be familiar and agrees with our expectations. For more general 
V[t] profiles, sometimes it may be easier to invert the Laplace transform T[s] and at other 
times it is easier to perform the convolution with respect to time. These approaches are 
compared in the exercises. 


5.7.2 Example: Damped Oscillator 


Consider once again the initial-value problem for the damped harmonic oscillator 


8 ð 
E +2y> + oft = fit], x10] = xo, x10] = vo (5.262) 
where we consider only positive times, £ > 0, that are suitable for the Laplace transform. 
An advantage of the Laplace transform is that the initial conditions are included automat- 
ically. Thus, the transformed equation 


s°X[s] — sxo — Vo + 2y(sk[s] — xp) + 055] = FIs] (5.263) 
reduces to 

X[s] = ž [s] + š% [s] (5.264) 
where 

le Xp (s + 2y) + vo 7 Xo(s +y) Vo (5.265) 


st 2ys + wr (s+ yy + (wi, — y?) (s+ yy + (we — y?) 
is the solution to the homogeneous equation that depends upon the initial conditions and 


Fis] 


> a (5.266) 
so + 2ys + Wo 


x,[s] = 
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is the particular solution to the inhomogeneous equation that depends upon the forcing 
term. The homogeneous solution can be inverted using 


1 Sin[at 
a 3 7 _ -n Sinlar] (5.267) 
(sty) +g a 
“| = e" Coslat] (5.268) 
(sty) +g 
to obtain 
yi Vo q; 
x,[] =e (x Coslar] + -2 Sin{ar]] (5.269) 
The particular solution takes the form of a convolution 
Í 
x,[t] = f Gi -t'f ] dt’ (5.270) 
0 
where 
Sin[at 
Gl] = A (5.271) 


is the Green function. Notice that the lower limit of integration is t = O because for 
the initial-value we can imagine that the forcing term is absent for £ < 0, implicitly 
using f[t] > f[t]O[t] for the method of Laplace transforms. Also notice the similarity 
between the Laplace and Fourier representations of the Green function, which are related 
by s > iw. Further analysis of the underdamped (œ = wa), critically damped (a — 0), and 
overdamped (a = ix) situations is identical to previous results using the Fourier transform. 


5.7.3 Example: Diffusion with Constant Boundary Value 


Suppose that y[x, t] satisfies a diffusion equation 


Py 1dw 

ebay ace 5.272 

Ox? K Ot ( ) 
for x > 0, subject to boundary conditions 

x>0 and r<O=>w=0 (5.273) 

x=0 and t=>0>Y=W (5.274) 


where W is a constant. For example, if a large block of material is brought into contact 
with a hotter block that is maintained at constant temperature, y could represent its tem- 
perature increase. Alternatively, if a block with pure chemical composition is immersed in 
a chemical solution, y could represent the concentration of a foreign chemical diffusing 
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into the block over time. It is useful to perform a Laplace transform with respect to time 


Ix, s] = f ` dte™ yix, t] (5.275) 
0 

yix t] = = | = dse" Wx, s] (5.276) 
2ri c—ioo 


to obtain an ordinary differential equation of the form 


H 
sd, H0s1= 
X K Ss 


where the Laplace constant y[x, 0*] vanishes for x > 0. The Laplace transform of the 
boundary condition y[0, t] = 4¿O[+] becomes WO, s] = a because of the step function 
inherent in the Laplace method. Notice that a Laplace transform with respect to x instead 
would yield an inhomogeneous equation because y[0*, t] = W does not vanish; hence, 
that strategy would not reduce the complexity of the original problem. 

The solution for W[x, s] that satisfies the boundary condition for WO, s] can now be 
written by inspection as 


(5.277) 


Ux, s] = to Exp|-xVs/« | (5.278) 


where we must obviously reject the exponentially growing possibility. The problem now 
is to obtain y[x, t] by inverting the Laplace transform. There is a simple pole at the origin 
and the square root in the exponential is made single-valued by a branch cut along the 
negative real axis. Using the contour sketched in Fig. 5.18, which encloses none of these 
features, we can write 


6 
$ dse" (x, s] = 0 = yx, t] = -Yo > I, (5.279) 
€ 2ni = 
where 
d 
I, = f os et Exp|-x s/k] (5.280) 
c, $ 


represents contributions from each segment of the contour. Segment C, corresponds to 
the Bromwich integral while the other segments serve to establish its value through the 
Cauchy integral theorem. 

The contributions to J, and J, from the left half-plane vanish in the limit R > «o 
because with 


s = Re” => [E Exp| s/k] 
s 


aR! Exp| Rr Cos[6]]| Exp|-xVR/«Cos[$]| (5.281) 


= 1,=1,=90 


the range 5 < 0 < x ensures that the dominant exponential has a negative argument. 


The short horizontal segments in the right half-plane with s = p + loo, where p is real 
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Ims 


C; 


Figure 5.18. Contour for inverse Laplace transform of Exp| -xV s7K |/ S. 


and ds = dp, do not contribute either because there 
. e” —¿p=1 . +in/4 
s=p+iR => — Exp|-x s/k] = FIR Exp|pr + iRt — xV R/ Ke” | (5.282) 
s 
Separating the real and imaginary parts of the argument of the exponential 
St 
ae Exp| -x s/k] ~ FiR”! Exp [or = xVR/2| Exp] +iRt 4 ixVR/2«| (5.283) 
5 


and retaining only the leading dependencies in R 


st 


s =p+iR Exp|-x s/k| = FiR! Exp|-xVR/2x] Exp[+iRt] (5.284) 
S 


we find that the magnitudes of integrands are proportional to R~! Exp[-aR!?] where 
a > 0. The lengths of these segments is finite and the integrands vanish quickly in the 
limit R > oo. 

As C, and C; approach the real axis, C, approaches a complete circle enclosing the 
pole with unit residue in a negative sense, such that 


ly = -2i (5.285) 
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The segments neighboring the real axis take the form 


y a2 | Wl ds _ | Isl | ds 
L= Exp| st — xe ale == Expļ| -st — ix rales 
ki A (5.286) 
2 2 . du 
= —2 Exp[-xtu? — iux] — 
0 u 
at i [sl | ds = o Isl | ds 
I, = Exp] st — xe —|—= Exp] -st + ix, | — | — 
0 pe 20 ¿el (5.287) 
i d 
=2 f Exp[—«tu2 + iux] 
0 u 
such that 
A 2 du Be eis 
I, +1, = 4i os Exp[-xtu*] Sin[ux] (5.288) 
0 
To evaluate the integral 
rs du 2 . 
élx] = a Exp[-xtu*] Sin[ux] (5.289) 
0 
we first consider 
[oe] 1 co 
€ lx] = { du Exp[-xtu?] Cos[ux] = 5 f du Exp[-xtu?] Expliux] (5.290) 
0 -00 
which can be obtained by completing the square 
co 3 x2 00 Ja x 2 
du Exp[-xtu“] Exp[iux] = Ex a] f{ duExp| -| V «tu — i 
{. pl ] Exp[ivx] P-a os | ( E) | 
m x? 
= |^ Exp -— 
kt xp| rr 
(5.291) 


Hence, the desired integral satisfies a differential equation 


vps Je UN > = 
ébl= 5 a Ex- |, 10] =0 (5.292) 


which can be integrated to obtain 


z x y” x/ 2 kt : n x 
= me dx Exp -— | = = dy = — Erf Be, 
ék] mh” xp] | Vn i e” dy=> r|- -| (5.293) 


Therefore, we finally obtain 


yix, 1] = voErte| | (5.294) 


where Erfc is the complementary error function. 
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0 1 2 3 4 5 


q=x/V xt 


Figure 5.19. Diffusion with constant boundary value. 


Notice that the natural time scale is determined by x7! while the natural distance scale 
is determined by V«t such that the dimensionless variable y = x/Vxt provides a sim- 
ple one-dimensional function describing both the distance and time dependence of the 
approach to equilibrium. Figure 5.19 illustrates the equilibration of the sample in terms of 
its natural dimensionless variable. This solution is characteristic of any diffusion problem 
(temperature, chemical concentration, etc.) for which a plane is maintained at a constant 
boundary value. 

It is useful to examine the behavior of this function for both small and large values of 
its dimensionless variables using series expansions. For a fixed position, the initial time 
variation is determined by the solution for large 7 while its asymptotic approach to equi- 
librium is determined by the solution for small 77. 


Series [Er £c [2] , tn, 0, 23] 


n 3 
1- — +0 
J dd 


(Series [Er fc [x] , {x, », 4+] / .{x > 9/23) / / Normal / / Simpli fy 


12 

2e 7 (-2+m?) 

Van? 
Perhaps it is also useful to visualize the equilibration process using the three-dimensional 
Fig. 5.20. The initial temperature elevation at x = 0 spreads gradually to more distant areas 
and later approaches the final temperature asymptotically. 

Integral transforms are so useful that extensive tables of Fourier and Laplace trans- 
forms have been compiled. Therefore, once one is comfortable with the basic methodol- 
ogy, one can often obtain the necessary transforms either from standard compilations or 
from MATHEMATICA or another symbolic math program. For example, MATHEMATICA can 
provide the inverse transform 


InverseLaplaceTrans form [“eexpt-xV s/x],s, +] /1 
s 


FullSimpli fy[#, {t >0,x>0,x>0}]& 
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Figure 5.20. Temperature equilibration (arbitrary units). 


needed for this problem without the pain of performing some rather tedious integrals by 


hand. 


Problems for Chapter 5 


Moments of form factors 
The charge density p,[r] in an atomic nucleus can be measured using electron scattering. 


1. 


Analysis of the angular distribution for electron scattering provides the form factor, which 


is related to the density by the Bessel transform 


(5.295) 


Sin[x] 
X 


An f drr’ jolgrle. lr] jod 


[q] = 


Pen 


where r is the radial coordinate, q is the momentum transfer, and j, is the spherical Bessel 
function of order zero. It is often useful to characterize distributions in terms of moments 


(5.296) 


M, lp] = 47 f drr’™™" plr] 
0 


and the mean-square radius 


(5.297) 


M 


(°) 


Finally, the nuclear charge density results from a convolution of the proton density p (in 


2 
0 


M 


within 


other words, the distribution of protons with the nucleus) and the charge density p, 


a proton such that 


(5.298) 


ACI 


=p 


Pon =P SP, = Pall 
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a) Show that a form factor can be expanded in terms of 


o0 


plal =>) a,M,,q” (5.299) 
n=0 


and determine the coefficients. How can one extract M,, using derivatives with respect 
to g°? How does (r?) enter a low-g expansion? 
b) Determine the general relationship between M),lp.; 1, M>,[e], and M,,[p,]. Find 


explicit formulas for n = 0, 1. In particular, how are the mean-square radii related? 


2. Elastic form factor for 160 
In the harmonic oscillator model, the radial wave functions for the nuclear 1s and 1p 
orbitals take the shapes 


r2 r2 
| R, [7] oc rExp|-35| (5.300) 


where bis the oscillator constant. The radial wave functions should be normalized so that 


R,,[r] oc Exp - 


f drr|R, Ir? = 1 (5.301) 
0 


The proton density for !6O, containing 2 protons in the 1s orbital and 6 protons in the 1p 
orbital, is then 


2R, [rÊ + 6R LP? 


plr] (5.302) 
4r 
The elastic form factor is given by 
3 PD Ae ot ; Sin[x] 
Pla] = 47 f drr” jolagrlelr] jol] = (5.303) 
0 


where q is the momentum transfer. Compute the elastic form factor and produce a semilog 
plot of Ila? given b = 1.8 fm. The momentum transfer q should be measured in units 
of fm”! where 1 femtometer (fm) corresponds to 107'* m, which is the appropriate length 
scale for nuclear physics. Also, determine the rms radius by examining the power series 
for p[q] for small q. 


3. Fourier transform of periodic function 
Suppose that f[t] = flt +T] is strictly periodic with period T. Evaluate the Fourier trans- 
form, flo. 


4. Array theorem 
Suppose that 


N 
fll = >" eix- xl (5.304) 


n=1 


consists of an array of identical distributions about a set of positions {x, }. 


Problems for Chapter 5 183 


a) Evaluate the Fourier transform f[k] by expressing the summation as a convolution 
with a set of point sources represented by delta functions. 

b) Perform the summation for a uniform array x, = na, n = 1, N and evaluate the inten- 
sity |F. 


5. Fraunhofer diffraction grating 
The Fraunhofer approximation to diffraction takes the form 


[oe] 2 
S[6] = | f A[x] Expligx] dx (5.305) 


where A[x] is an aperture function that takes unit values for open regions and vanishes at 
obstacles and where q = k Sin[@] for wave number k = 27/1. The aperture function for a 
grating with N identical slits takes the form 


N 
Alx] = $ @[b = 2lx - nal] (5.306) 
n=1 
where a is the spacing and b < ais the width. Use the convolution theorem to evaluate the 
diffraction pattern efficiently. 


6. Convolution of a Breit-Wigner resonance with a Gaussian resolution function 
Suppose that the energy dependence of the differential cross section for a nuclear reaction 
is described by the Breit-Wigner profile 


(5.307) 


where T is the full-width at half-maximum and Sp is the integral over energy. However, the 
measurement of energy is smeared by a Gaussian resolution function 


Rie] = (5.308) 


1 j w? 
Vro? x| A 
such that the observed distribution is the convolution Y = S Y R. Use the convolution 
theorem to obtain an explicit formula for Y [w]; your result will probably involve the com- 
plementary error function. If you have access to suitable mathematical software, compare 
your symbolic result with numerical evaluation of the convolution integral. Also compare 
Y[w] with S[w]. (Note that MATHEMATICA 5.1 cannot evaluate the convolution integral 
symbolically.) 


7. Loaded beam 
The displacement of an infinitely long beam on an elastic foundation is described by 


d’ 
(5 + JES == P[x] (5.309) 


where we assume that y[+co] = 0. Use the Fourier transform method to compute the 
Green function for this system. Then provide an integral which can be used to compute 
the displacement given an arbitrary P[x]. 
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8. Diffusion and radiation of heat 
Heat diffuses in a medium that also radiates heat at a rate proportional to temperature. 
Thus, a one-dimensional spatial distribution of temperature T[x, t] satisfies 


OF eine car (5.310) 


where D and @ are positive constants. A finite amount of heat is released at a point in an 
infinite bar such that 


T[x, 0] = Qo[x] (5.311) 


a) Evaluate and describe the temperature distribution at later times. 
b) Compute the total amount of heat lost to radiation between time zero and time ¢ and 
interpret the result. 


9. Damped oscillator subject to a step function 
Use the Fourier transform to solve 


(5 + a + 03) = fl], t<0= x1t] =x' [4] =0 (5.312) 
when 
Fla] = ore] (5.313) 


is a step function that “turns on” at t = 0. Find explicit solutions for the underdamped, 
overdamped, and critically damped cases. Compare these three solutions graphically and 
explain their behavior. 


10. Damped oscillator subject to square pulse 
Use the Fourier transform to solve 


a ð 2 
ae + Y + wo | xit] == fit] (5.314) 
when 
FEI = (OI: +7] - Olt — 7)) (5.315) 


is a square pulse. Plot the solutions for y/w, = 0.5, 1.0, 2.0 with 7 = 5/w, as functions 
of Wot and explain their general characteristics. If this were an RLC circuit, under what 
conditions would the output follow the input most closely? 


11. Orthogonality relations for Fourier sine and cosine transforms 


Derive the orthogonality relations: 


{ dt Cos[wt] Cos[w’t] = Sólo - w] + [lw + w]) (5.316) 
0 


T dt Sin[wt] Sin[w’t] = 5 (le - w'] -6[w + w']) (5.317) 
0 
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12. Relationship between Fourier sine, cosine, and exponential transforms 
Demonstrate the equivalence between the representations 


m= f * doe He] = 2 f > do(JE lo] Cos[wr] + 8 [wl Sin[wr}) 
27 Js T Jo 


(5.318) 
where 
FON = 5011 fD (5.319) 
and 
flo] = f are” fir] (5.320) 
fe lo] = f © dt Costo FON (5.321) 
Btw = [a Sinton fOr (5.322) 
Finally, demonstrate that 
Flo] =2(70 lo] + if lol] (5.323) 
13. Green functions using Fourier cosine or sine transforms 
The Fourier cosine transform is defined by 
mua = Fef = f fi Costes) ds (5.324) 
a) Use the Fourier cosine transform to solve 
y"[x] - ay] = 0 y[0]=>b,  yloo]==0 (5.325) 


b) Next use the Fourier cosine transform to obtain the Green function that satisfies 
GU [x, x] — a’G, [x x]=0lx-xw] Gi[0,x’] == 0, G,[00, x’] == (5.326) 
and write a formal solution for the inhomogeneous equation 
y" [x] - o*y[x] = fix] yl] =b, yloo] =0 (5.327) 


c) Now suppose that the lower boundary conditions are replaced by y[0] == a for the 
homogeneous problem and by G,[0,x"] == O for the Green function. How do the 
solutions change? Why is the Laplace transform inconvenient for this problem? 
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14. Born approximation for Yukawa potential 
The Born approximation for elastic scattering of distinguishable particles with interaction 
potential V approximates the differential cross-section by |M|? where the matrix element 
takes the form 

Mss f © [FV TPID; [P] ar (5.328) 

27 

with y being the reduced mass and 7 = 7, — 7, the separation vector. The wave functions 
are approximated by plane waves of the form 


©, = Explik,-7], ©, = Explik, -7] (5.329) 
where the wave vectors are equal in magnitude, such that k, = k;. Evaluate M for the 


central part of the Yukawa interation between two nucleons, given by 


eT" 
Vin) = Yo m,r 


T 


(5.330) 


where m, is the pion mass, Vo is a strength constant, and r = [r, — 7,1. Express your result 
both in terms of the momentum transfer g = k; -= k; and in terms of the scattering angle 6. 
(Note: we are neglecting the spin dependence and tensor properties of the nucleon—nucleon 
interaction.) 


15. Numerical convolution using FFT 
Use the FFT function in your favorite mathematical software to reproduce the figures in 
the text relating to the convolution 


x(t] = f j G[t - t]F[t] dt (5.331) 
where 
G[t] = wp Expl-yt] Sin[wpt]O[t], wg = y Wi -y (5.332) 


We used wọ = 0.3 and y = 0.05 in natural units, N = 256, and 


t—sy 
Fit]=(-sy Expl- >] (5.333) 
w 
with s = 150 and w = 200. Show that a judicious choice of phase shift suppresses artifacts 
due to wrap-around in the working array. 


16. Temporal correlation 
Use the FFT function in your favorite mathematical software to reproduce the figures in 
the text relating to the autocorrelation function for the logistic map. 


17. Van der Pol spectrum 

Again assuming that mathematical software with both a differential-equation solver and 
FFT is available, reproduce the figures in the text that illustrate the power spectrum for a 
van der Pol oscillator. 
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18. Windowing 

The spreading of the power spectrum for a discrete sinusoid is similar to that of a finite 
wave train using a continuous Fourier transform. In both cases one is effectively multiply- 
ing a persistent function by a time window of the form O[tJO[T — t] that has sharp edges 
and reproduction of a rapid temporal transition requires Fourier components with arbi- 
trarily high frequency. Therefore, the window function artificially enhances the power for 
large frequencies. More realistic estimates of the power spectrum for the input signal can 
be obtained by using a window function, w[f], with softer features that do not distort high 
frequencies as badly. For the purposes of discrete Fourier transform using sampled data 
within a range O < t < T, the window function should vanish for t < 0 and t = T, should 
be relatively flat over a wide central region surrounding a maximum value of unity at 7/2, 
and should be symmetric about 7/2. Many window functions with these properties can 
be found in the literature and their proponents often argue that their version optimizes a 
particularly important figure of merit, such as equivalent noise bandwidth, scallop loss, or 
other esoterica. The nonspecialist may as well use the simple Welch window 

=N=1P 

ET) 

with parabolic shape. The purpose of this problem is to demonstrate numerically, with 
two simple examples, that use of an appropriate window function provides qualitative 


improvements in the power spectrum. More detailed analysis of the performance of various 
window functions may be found in specialized literature. 


w =1- (5.334) 


J 


a) Show that the Welch window has the desired temporal characteristics and then display 
its discrete Fourier transform using N = 256. 

b) Compare power spectra for f[t]O[t]O[T — t] and w[t] f[t] where f[t] = Sin[0.3£] and 
comment on the effect of using a softer window. 

c) Apply the Welch window to improve the estimated power spectrum for the van der 
Pol oscillator with e = 10. We suggest 4096 samples within 200 < £ < 600. 


19. Laplace transform of J¿[x] using an integral representation 
a) Demonstrate that the integral representation 
1 T 
J= = f d8 Cos|z Cos[8]] (5.335) 
0 
provides a solution to Bessel’s equation 
z f(z] + zf [zl + (z? - y?) fiz] == (5.336) 
for v = 0 and initial condition f[0] = 1. 
b) Use this integral representation to determine the Laplace transform of Jọ [x]. 


20. Laplace transform of J [x] using its differential equation 
The Bessel function J¿[x] satisfies the differential equation 


Ë 1d 
(Je +" + vota =0, Jo[0]=1 (5.337) 
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a) Show that the Laplace transform do [s] satisfies a first-order differential equation and 
obtain Jls]. (Hint: if f[x] is continuous at x = 0, you can use Le eXflxdx = 
s~! f[0] when s > co.) 


b) Use an expansion of J [s] in powers of s™! to develop a power series for Jo[x]. What 
is the radius of convergence? 


21. Decay chain 
Suppose that a radioactive nucleus decays sequentially such that the populations of the 
first three members of the decay chain evolve according to the rate equations 


dN 

a =-A/N, (5.338) 
dN. 

ae == A,N, - AN, (5.339) 
dN 

FA = A,N, — AN, (5.340) 


Use the Laplace transform method to determine N,[t] assuming initial populations N,[0] = 
ni. How can we extend the procedure to longer sequences? 


22. RC circuit using Laplace transform 
In the text we demonstrated that the current in a simple RC circuit with a variable voltage 
source V[t] satisfying 


RI[t] + zÍ It']dt =V[t], JI[t <0] =0 (5.341) 
0 


can be expressed as 


5 s z 
I= .342 
1+sT GR 


or 
Ri{t] = Vit] -r te~” { Exp[t’/t]V[t’] dt’ (5.343) 
0 


where 7 = RC is the characteristic time constant for this circuit. For each of the following 
V[t] profiles, demonstrate that both the inverse Laplace transform and the convolution 
integral give the same current /[f]. 


a) Step function: V[t] = V,©[t — a] with a > 0. 
b) Rectangular pulse: V[¢] = V)(O[t — a] — O[t — b]) with b > a > 0. 


c) Sinusoidal voltage: V[t] = V, Sin[wr]@[r]. Identify the transient component and eval- 
uate the phase shift between current and voltage oscillations after the transient has 
decayed away. 
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23. Current in RLC circuit using Laplace transform 
The current in a series RLC circuit with a variable voltage source V[t] satisfies 


jaa +RI+ : f 1[11] dt’ == V[t] (5.344) 
dt C Jo > l 


where we assume that the circuit is quiescent for £ < 0 (no current and discharged capaci- 
tor). Although this equation is usually converted into a differential equation for the stored 
charge, it is instructive to solve the problem as an integro-differential equation instead. 


a) Express the relationship between Laplace transforms 7 and Y for current and voltage 
as a convolution and determine the Green functions for (i) underdamped, (ii) over- 
damped, and (iii) critically damped circuits. 

b) Use the Laplace transform method to evaluate /[t] when the voltage is constant dur- 
ing a <t < b. Sketch the behavior of underdamped and overdamped circuits. 


c) Use the Laplace transform method to evaluate the current for a sinusoidal voltage 
switched on for £ > 0. Identify both the transient component and the phase-shift 
between current and voltage oscillations. 


24. Laplace transform of periodic functions 
Consider a periodic function of the form f[t + T] = fit]. 


a) Show that the Laplace transform is given by 


F 
fis] = = f file" dt (5.345) 
l-e 0 


What happens in the limit T > co? 
b) Verify that the expected results are obtained for Sin[t] or Cos[f]. 
c) Evaluate Laplace transforms of the square wave. 
d) Evaluate the Laplace transform of the sawtooth wave, f[t] = Modit, T]. 


25. Rectifiers 

An ideal half-wave rectifier circuit passes positive and blocks negative voltage signals 
while the output of an ideal full-wave rectifier is the absolute value of the input voltage. 
Evaluate the Laplace transform of the output for both half-wave and full-wave rectification 
of a sinusoidal input voltage, Sin[wr]. 


26. Initial-value problems using Laplace transform 
Solve the following differential equations, with specific initial conditions, using the Laplace 
transform. 


d* 
a) CA —Dy[t]=t, y[0] = y'[0] = y”[0] = y’”[0] = 0 (5.346) 
b) y [r] + { y[t]dt==e", y[0]=1 (5.347) 
0 


c) y”lx] +y [x] + ylx]=0, y[0]=1, y'[0] =0 (5.348) 
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27. Difference-differential equation using Laplace transform 
Suppose that y[f] satisfies a difference-differential equation 


t > 0 = y'[t] + yit] — yit - 1] = 0 (5.349) 
with the initial condition 

-1 < t < 0 = y(t] = yolt] (5.350) 
where yọf[ź] is a known function. 


a) Show that the Laplace transform satisfies 
0 

yis] = (1 +s-e y”? [vio + e f e™ yit] ar) (5.351) 
-1 


b) Compute y[1] for t > O given yo[t] = 1 and check the result. 


28. Abel’s integral equation 
Abel’s integral equation for the unknown function y[t], with £ > 0, is 


f ylr]t -— 1)“ dr = f[t] (5.352) 
0 


where @ is a constant in the range 0 < œ < 1 and f[t] is a known function. 


a) Use the convolution theorem for Laplace transforms to obtain a formal solution for y[¢]. 
b) Evaluate y[t] given f[t] = 1 and check the result. 


29. Wave equation with traveling disturbance 
Consider an inhomogeneous wave equation of the form 


8 1 0? 
ôl or 


Jus t] == Ó[x — vt] (5.353) 


fort > 0in0 < x < oo subject to the boundary and initial conditions 


Opa, 0] 
ðt 


WO, t]=0, yix, 0] == =0 (5.354) 


Use a Laplace transform with respect to £ to solve this equation for v < c, v = c, and v > c. 
Describe and compare these scenarios. 


6 Analytic Continuation 
and Dispersion Relations 


Abstract. If two representations of analytic functions are equivalent in the overlap- 
ping regions of their domains, they represent the same function. Analytic continua- 
tion provides a systematic means for extending the definition of an analytic function 
into broader domains of the complex plane. Combined with physical requirements 
of causality, this method can be used to relate the real and imaginary parts for func- 
tions that describe the response of a physical system to an external perturbation. 
We use this method to analyze the relationship between refraction and absorption in 
optical media and dispersion relations for wave propagation. We also introduce the 
phenomenon of solitons in systems with nonlinear dispersion. 


6.1 Analytic Continuation 
6.1.1 Motivation 


Often one develops a representation of an analytic function f[z] in terms of a series expan- 
sion or an integral that converges in a finite domain D, and would like to extend that 
domain into a broader region of the complex plane. To illustrate the idea, we consider a 
very simple example. The function f[z] = (1 — z)7! can be expanded about any finite z, 
using a Taylor series 


co yk 
fiz] - Y En (6.1) 


il 
tao (1 = Zn)" 


that will converge in the domain D,,: |z — z,| < |1 — z,|. Although we can easily derive the 
series for this simple function, in a more complicated problem we might have developed 
the series representation near a particular z, without knowledge of the underlying analytic 
function that it represents. Thus, if we had deduced the series 


All = >°z, (6.2) 
k=0 
we could then demonstrate that f, converges to f[z] = (1 — z)! within the domain 


D,: ll < 1. Similarly, by considering the region around z) = i we might deduce a sec- 
ond series 


2 Nk 
fd = y] E (6.3) 


KH 
m 7D" 
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Figure 6.1. Analytic continuation using Taylor series. Here a Taylor series around z = 0 converges 
in domain D,, while another series around z = i converges in domain D,. The heavy dot represents a 
singularity that limits the convergence radii for both domains. If these series are equal in the overlap 
D, N D,, they are elements of the same analytic function in a larger domain that includes D, U D,. 


and then demonstrate that f, converges to the same f[z] = (1 — z)! within the domain 
Dy: |\z—-i| < V2. Although these series look different, they represent the same function 
within the overlap region D, N D,. We describe f, as an analytic continuation of f, from 
domain D, into domain D, and identify both f, and f, as elements of the same analytic 
function f[z]. Although f, and f, converge and hence are analytic only within their respec- 
tive domains D, and D,, the underlying function f is analytic in a domain that is at least 
as large as the union D, U D,. Therefore, the underlying analytic function is more general 
than any of its representations (elements). 

Generally one attempts to perform a sequence of analytic continuations which extends 
the domain to the largest possible region of the complex plane, but in practice this process 
is often limited by singularities or branch points. Taylor series often provide the simplest 
method, where the radius of various overlapping disks is limited only by the distance to 
the nearest singularity, but other methods are sometimes more efficient. 


6.1.2 Uniqueness 


Suppose that f,[z] and f,[z] are analytic in domains D, and D,. One can show that if 
D, AD, contains more than one point it contains an infinite number of points and also 
constitutes a domain. Thus, the function g[z] = filz] — f,[z] is analytic in D, A D,. Now 
suppose that g[z] = 0 on an arc or within a subdomain that lies within D, ND,. Recognizing 
that the zeros of an analytic function are isolated (recall Sec. 1.12.5) unless the function 
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Figure 6.2. Analytic continuation using a sequence of domains D, > D, > D, > D, that surrounds 
a singularity. 


vanishes identically, we conclude that g[z] = 0 throughout D, N D,, such that f,[z] = fiz] 

throughout D, N D,. Hence, any pair of functions f; and f, that are equal on an arc or 

a subdomain within their common domain of analyticity D, N D, represents the same 

analytic function everywhere throughout D, A D,. Therefore, a function that is analytic in 

domain D is uniquely determined by its values on any arc or domain contained within D. 
Reconsider the function 


fla=> a (6.4) 
k=0 


which is analytic within |z| < 1 where it converges to f[z] = (1 — z)7*. According to the 
uniqueness theorem, f[z] is the only function that is analytic in the entire complex plane, 
except for z = 1, and coincides with f, in their common domain |z| < 1. Therefore, f is 
the unique analytic continuation of f, into the entire complex plane. 

Now suppose that we attempt to extend a function by analytic continuation from D, 
into D, into D}, etc., carefully avoiding singularities in an attempt to cover the largest 
possible region of the complex plane. A schematic set of overlapping domains is sketched 
in Fig. 6.2 representing the sequence of continuations fi > fh > f > fa Will f,[z] == 
filz] in D, ND,? Maybe, but maybe not — the uniqueness theorem provides no guidance on 
this question because there is no overlap between D, and D, or between D, and D, in the 
sequence sketched. Often a singularity or branch point in the nonoverlapping region will 
spoil the closure of such a sequence. Perhaps it will help to consider a specific example, 
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even if it is somewhat artificial. The sequence of functions 


filz] = vrei” 0<0,<x (6.5) 
fiz] = Vre? 5 <0, < - (6.6) 
hel] = vre? <0; < 2r (6.7) 
filiz] = Vre? = <0,< Z (6.8) 


all represent branches of yz continued from one half-plane into another that overlaps in 
one quadrant. Thus, f, is a continuation of f, into the third quadrant with f, = f, in the 
second quadrant, f} continues into the fourth quadrant with f} = f, in the third, and f} 
returns to the first quadrant with f} = f, in the fourth. However, even though f, and f} are 
both defined in the first quadrant, they are not equal: 


fylzl =-f [2] inD, ND, (6.9) 


Here analytic continuation fails to close because the path encloses a branch point of yz. 
Note that both D, N D, and D, N D, contain only one point, the branch point, which does 
not constitute a domain. 

The theorem that an analytic function is completely determined throughout its domain 
of analyticity by its values on an arc appears to be extremely powerful. If physics argu- 
ments require a function to be analytic, it might appear that we could construct the entire 
function by measuring on such an arc, which is surely easier than measuring it every- 
where in a domain. It is almost like cloning your mother from a hair follicle! However, 
mathematical and physical standards of knowledge are different. To apply this theorem 
we would have to make measurements that are perfectly accurate, which is not possible. 
In practice, the errors in the reconstruction process would grow quickly as the distance 
from the measured region increases. The larger the arc the better the convergence of the 
reconstruction is likely to be, but analysis of the accuracy of such a procedure requires 
considerable sophistication. Nevertheless, in many fields one can make much progress by 
measuring the function over as large an arc as possible, typically as much of the real axis 
as possible, and constraining the asymptotic behavior using physical principles. Thus, the 
theory of analytic functions finds widespread application in nuclear, particle, condensed 
matter, and other fields. A few basic examples will be studied later in this chapter. 


6.1.3 Reflection Principle 


Suppose that f[z] is analytic in a domain D that includes a segment of the real axis and 
that f[x] is real on that segment, such that f[x] = f[x]*. Using the Cauchy—Riemann 
equations, one can show that if f[z] is analytic in a domain that includes both z and z*, 
then so is f[z*]*. Hence, because the two analytic functions f[z*]* and f[z] are equal on 
an arc within D, they are equal throughout the portion of D that is symmetric with respect 
to the real axis. This result is known as the Schwarz reflection principle. 
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6.1.4 Permanence of Algebraic Form 


Suppose that a function f[x] is defined on a portion of the real axis. If we then define f[z] 
by replacing the real variable by a complex variable, x => z, the function f[z] uniquely 
determines the analytic function that has the same values on that portion of the real axis 
throughout any domain that contains it; there can be no other distinct analytic continuation 
from the real axis into the complex plane. For example, if f[x] is represented by a power 
series that converges on some portion of the real axis, the same series with x > z represents 
an analytic function that converges in domains containing that portion of the real axis. We 
implicitly used that property when deriving the Euler identity by replacing x by ¿0 in the 
Taylor series for e* (recall Sec. 1.1.3). In fact, e* is the only entire function which reduces 
to e* on the real axis. More generally, if f[x] and g[x] satisfy an algebraic relationship 
of the form F [ flx], elx]| == 0, then the analytic continuations f[z] and g[z] satisfy an 
algebraic relationship F| f[z], gizl] == 0 throughout their common domain of analyticity. 
Furthermore, if F [ fill, 2, [zl] == 0 throughout domain D,, where F is an analytic function 
of its arguments, then F [HE golz]] == 0 in domain D, where f, and g, are analytic 
continuations of f, and g, from D, into D,. This property is described as permanence of 
algebraic form. Often it is simpler to prove a relationship using one representation than 
another, but for analytic functions we can be confident that the same relationship applies 
to all representations within their domains of analyticity. 


6.1.5 Example: Gamma Function 


The gamma function for positive real variables x is defined by the integral 
T[x] = f tle'tdt forx>0 (6.10) 
0 


and reduces to the factorial P[n] = (n — 1)! for positive integers. Note that we must restrict 
the range of x to ensure convergence of this integral representation. Continuing this func- 
tion into the complex plane, we attempt to define the analytic function T'[z] as 


TE] = f Ele” dt for Relz] > 0 (6.11) 
0 


This definition clearly agrees with the first on the real axis, but we must prove that the 
extended definition is analytic in a domain that includes the positive real axis. For this 
we appeal to Morera’s theorem (Sec. 1.10.5) which states that if $- f(z] dz = 0 for every 
simple closed contour C in domain D, then f[z] is analytic in D. First, we prove that T[z] 
converges absolutely by evaluating 


[T[z]| = if te" dt 
0 


where ¢ is real and z = x + iy. Next we express contour integrals in the form 


rel dz= A let dt)dz = f ($ pl deje” dt =0 (6.13) 


< f Ple” dt = |[Plx+ iyl| < Tix] (6.12) 
0 
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where absolute convergence justifies exchanging the order of integrations. Recognizing 
that 177! is analytic, the innermost integral vanishes and proves that T[z] is analytic in the 
domain Re[z] > 0. 

Using integration by parts, 


Tlz +1] = i ře“ dt = (Be) + ef tle” dt (6.14) 
0 0 
one easily obtains a recursion relation 
Plz + 1] = zz] (6.15) 


that can be used to continue I into the region with Re[z] < 0. First, 


_The+ 1 
E Z 


T[z] 


— 1 < Re[z] <0, z#0,-l (6.16) 


provides a definition that now extends in the domain Re[z] = —1 excluding z = 0, 1. Then, 


_ Tlz+2] 
riz] = 2(z + 1) 


-2 < Reiz] < -1, 2+-1,-2 (6.17) 


extends the domain a little further. By repeating this process indefinitely, we obtain an 
analytic continuation 


_ Tk +n+1] 


fase 
lz] To (Z + k) 


with n < —Relz] <n+1 (6.18) 
that extends the definition of P'[z] into the entire complex plane excluding nonpositive 
integers. Although this might not be the most convenient representation, the uniqueness 
theorem ensures that any analytic function that reproduces T [z] for positive Re[z] will pro- 
duce the same values in its range of analyticity as produced by the representation above, 
and hence is really the same function whatever its superficial appearance might be. We 
will study the gamma function in more detail later, developing a wider variety of repre- 
sentations and relationships, but the present analysis suffices to demonstrate the power of 
analytic continuation. 


6.2 Dispersion Relations 
6.2.1 Causality 


Often there are physical arguments, such as causality, that justify extension of a function 
of a real variable into the complex plane as an analytic function of the complex variable 
whose real part is experimentally accessible. For example, the propagation of an electro- 
magnetic wave through a homogeneous isotropic medium is represented by a complex 
refractive index of the form A = n + ia where n[w] and a[w] are real functions of fre- 
quency w that govern the phase velocity and absorption of the wave. These properties 
depend upon the electromagnetic properties of the constituents and the collective response 
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of the system to an electromagnetic field. The dependence of phase velocity upon fre- 
quency is known as dispersion. Using quite general arguments based upon causality, one 
argues that the combined function Afw] = n[w] + ia[w] is an analytic function of com- 
plex frequency whose real and imaginary components reduce to the refractive index and 
absorption coefficient on the physical (real) axis. The theory of analytic functions then 
provides some very powerful relationships between the dispersive (refractive) properties 
and the absorptive properties of the system, known as dispersion relations. Dispersion 
relations were first derived by Kramers and Kronig for electromagnetic theory but have 
since found widespread application in many areas of physics, including nuclear, particle, 
and condensed matter physics. Rather than develop the theory of dispersion relations for 
arbitrary analytic functions, we illustrate the general approach using the specific case of 
electromagnetic waves in a dielectric medium. 

Consider a plane wave of the form Exp|i(kx - wt)| where the wave number is given by 


0 
k=n— 
Cc 


w 0 

= —y pe = (n + ia)— (6.19) 
c Cc 

where 7i[w] is the complex refractive index and e[w] is the dielectric permittivity. For 

simplicity we will assume that the medium is nonconducting and nonmagnetic (u = 1), 


such that fi? = e. Identifying the real and imaginary parts of ñ[w] as n[w] and a[w], the 
plane wave 


@ œ Exp|i(kx — wt)| = Exp|i(nx — ct)w/c| Exp[-ax/2] = Ip? œ e (6.20) 


propagates with phase velocity c/n but is absorbed with attenuation length a~! = c/2wa 
(reduced in intensity by the factor e7! in distance a~!). In Gaussian units the dielectric 
permittivity is related to the electric susceptibility y by 


e=1+4zy (6.21) 


where the polarization of the medium is related to the electric field by P = yE. 
Next consider the electric displacement D[w] = e[w]E[w] for a wave packet 


D{t] = l f > dwe ™ D[w] = t i > dwe EwlE[w] (6.22) 
27 Jo, DIC 2) 00 
where 
E[w] = f > dte E[t] (6.23) 


Assuming that the integrals converge and interchanging the order of integrations, we obtain 


1 ij a ES e 
mal doere | dt'e Eft’] 
277 —00 =o: 


1 [oe] co . 
zS drtE[t — a f dwe" elw] 
27 -0o —0co 


Dit] 


(6.24) 
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where T = t — t’. Then substituting e > 1 + 47y, we find 


co 


D{t] = + f 7 dtE{t - e][2róte + 47 f due" yg] (6.25) 


or 
D[t] = E[t] + T dr Elt — t]G[r] (6.26) 


where the first term is the incident plane wave in the absence of the medium, while the 
second term arises from the contribution of charges in the medium governed by a response 
function of the form 


G[t] = L f dw e (e[0] — 1) (6.27) 
27 Jo 
Physically we expect that the response of the system at time £ can only depend upon fields 
at earlier times 1” < t => T > 0. This notion that effects must follow their causes is known 
as the causality principle. Therefore, we expect G[7] to vanish for 7 < 0 independent of 
any specific properties of the system and can write 


Dit] = E[t] + qe dtE[t - T]G[T] (6.28) 
0 


Equations of this form are typical expressions of the delayed response of a linear system to 
a driving force. If the damping is weak the present amplitude may be amplified by coherent 
contributions from many earlier vibration periods. 

We also expect that w > œ = y > 0 = e > 1 for extremely high frequencies 
where the electromagnetic field oscillates too rapidly for any charged particles with inertia 
to respond. Under these conditions the integral for the response function converges for real 
values of w. Suppose that we allow w to be a complex variable and define 


1 —iwT 
Git] => $ e™ (ew]- 1)dw (6.29) 


where the contour C includes the real axis. We can close the contour for 7 < O with a great 
semicircle around the upper half-plane, as sketched below, such that 


E hae 
Gir] = oe f f e™ ew] - 1)dw 
+iR f Exp|TR Sin[6]] Exp[ -¿R7 Cos[0]](e[Re*] - 1)d@ (6.30) 
0 
can be expressed in terms of a contribution from the real axis that approaches G[7] in 


the limit R > oo and a contribution from the great semicircle that vanishes for nega- 
tive 7 because e is bounded and the integrand includes an exponential whose argument 
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Figure 6.3. Contour for G[t < 0]. Two poles of e[w] are indicated by heavy dots. Causality confines 
singularities of e to the lower half-plane such that G[t < 0] = 0. 


approaches —oo. Similarly, for positive r we close C around the lower half-plane and again 
find that GIT] > G[r], such that 


1 . 
G[r] = — $ e““"(e[w] - 1)dw (6.31) 
27 C 
However, the causality principle 
1 
retort $ e“"(e[w] — 1)dw = 0 (6.32) 
Cc 


requires G[7] to vanish for 7 < 0. Therefore, according to the Cauchy integral theorem, we 
conclude that e[w] is analytic within the upper half of the complex w plane. Any singular- 
ities, such as poles or branch cuts, are confined to the lower half-plane and are responsible 
for the delayed response of the system to an external influence. The two black dots in 
Fig. 6.3 represent possible poles in the lower half-plane. 

The convolution formula for D[t] can be obtained more easily using physical reason- 
ing than by the derivation above. Recognizing that e[w] does not have a simple Fourier 
transform because it approaches unity rather than zero for infinite frequencies, we simply 
add and subtract that offset to write 


D[w] = e[w]£[w] = Elw] + (e[w] — 1IE[w] (6.33) 


where the function (€[w] — 1) is analytic in the upper half-plane and is suitable for Fourier 
transformation because it does approach zero for w => oo. Then applying the convolution 
theorem to the product of Fourier transforms, we can write 


Dit] = E[t] + T dTE[t-—T]G[T] (6.34) 
0 
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where 
Lf E 
GIT] = 5> f dw e(ew] — 1) (6.35) 
277 J_ 


00 


practically by inspection. 


6.2.2 Oscillator Model 


Let us illustrate this argument with a toy model. Consider a particle of mass m and charge q 
that is attached to an atom by a spring (linear restoring force) with natural frequency wy 
and which experiences an external electric field E,,e7*”. We also assume that the motion 
of the particle is damped, either by interactions with the surrounding medium or by its own 
radiation of energy. The equation of motion for a driven damped oscillator takes the form 


ï+ yk + wx = hie (6.36) 
Using 
E 
{if =x,e = x, = 2 v (6.37) 


the dipole moment becomes 


g E Po _ T 1 
= u. => (Y = w = 6.38 
Po= m w — 2iyw — w? E, mw, -2iyw-w* — 


where « is the single-particle polarizability. If we have a dilute system with N oscillators 
per unit volume, such that y = Na, the dielectric permittivity becomes 


2 2 
e=1+4IN0= 14 32, o =ArNL (6.39) 
wg — 2iyw — w m 


where w, is known as the plasma frequency. According to this oscillator model, € is an 
analytic function except for two simple poles in the lower half-plane located at 


(6.40) 


y <0 => w, x —iytw RR Fz (6.41) 
Notice that damping requires y > 0. Hence, the absence of spontaneously growing solu- 
tions confines the poles to the lower half-plane and is closely related to the property of 
causality in physics. The dielectric permittivity for a single mode is sketched in Fig. 6.4. If 
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Figure 6.4. Dielectric permittivity for a single mode modeled as a weakly damped oscillator. 


the damping is weak, absorption is confined to a narrow peak around the resonant fre- 
quency wọ. The refractive index is slightly greater (smaller) than unity below (above) 
the peak. Normal dispersion with 0n/0w > 0 occurs over most of the frequency range, 
but in the immediate vicinity of the resonant frequency there is anomalous dispersion 
with 0n/dw < 0. 

To evaluate G[t] for 7 > 0, we must close the contour in the lower half-plane, as shown 
in Fig. 6.5, to ensure that the contribution of the great semicircle vanishes. Thus, the value 
of the integral becomes —277 times the sum of the residues, where the negative sign arises 
because the contour is traversed in a negative (clockwise) sense. 


1 : Ñ 
G[r] = > perra - 1)dw = -i(R,e +" +4+R_e 5) forr>0 (6.42) 
E 
Therefore, we obtain the response 


Gir) = oger er 


ofr] (6.43) 
0 


where the Heaviside step function 


O[r]=1 forr>0 


(6.44) 
O[r]=0 forr<0 


enforces causality. Although this model is simplistic, it does satisfy the physical constraints 
of causality and analyticity. We find that the response vanishes at T < 0, its initial growth is 
governed by wo, and its decay is determined by y. If the applied field is oscillatory, then the 
response of the system will be greatest when the driving frequency is close to the system’s 
natural frequency and the damping is small enough that many earlier periods reinforce the 
response to the current period. 

More generally, if the system is composed of a single type of atom with Z electrons and 
we assume these atoms have a spectrum of normal modes of vibration with frequencies w ,, 
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Figure 6.5. Left: G[7 > 0] contour. Right: response for A = 0.2w, 


the dielectric permittivity can be parametrized in the form 


2 
e= 1+4rNÅ - fi To (6.45) 
m, 4 wi — 2iy¡w -w F 


where the parameters f, are described as oscillator strengths. The low-frequency response 
is determined by the limit w « w; where 


2 ; 2 
Ree x 1+4nN= Dala) (6.46) 
Me 7 wj w 


fY 
wt 


e) 

2 
Ime x 4IN— AT 
me ANO), (6.47) 


e j J 


Hence, if all normal modes have nonvanishing frequencies, the low-frequency response of 
dielectrics has the form 


Reelw] x 1+atbu* (6.48) 
Ime[w] x cw (6.49) 
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where a, b, c are constants. At the other extreme, the asymptotic response to large frequen- 
cies takes the form 


2 2iy, 
e Yj 
vel Dal - =) (6.50) 
such that 
w, \2 we 
Ree= 1- (22) ; Ime = 27-2 (6.51) 
w w 
where 
2 e e? 
wp = 4aN— y fi = 4nNZ— (6.52) 
e j e 
is the plasma frequency and 
A I 
12D (6.53) 
j 


is the mean oscillator strength. These asymptotic characteristics are expected to be more 
general than this semiclassical oscillator model. Therefore, the high-frequency response 
of real dielectrics is often characterized by the parameters 


w = lim (w? (1 — Ree[w))) (6.54) 
n [o 
y= lim (5 Im a] (6.55) 


The behavior of conductors is somewhat more complicated because the frequency of the 
lowest mode vanishes and will be considered in the exercises. 


6.2.3 Kramers—Kronig Relations 


The analyticity of €[w] in the upper half of the complex w plane allows one to derive useful 
relationships between the real and imaginary, or dispersive and absorptive, components of 
e on the physical (real) axis. Applying the Cauchy integral formula to e— 1, which vanishes 
for large w, we write 

elw] =1 + t ea du (6.56) 

2ni Jo w -w 

where C is any contour in the upper half of the w’-plane that encloses w. Experimental 
observations are made for real frequencies, of course, but the integrand appears to be sin- 
gular for real w. Formally we evaluate the integral for real frequencies by replacing w 
by w + iô, where 6 is a small positive number, and then taking the limit 6 > 0* from the 
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Figure 6.6. Contour in w’ used to evaluate Eq. (6.56) for w on the positive real axis. 


positive side. Assuming that an infinitesimal excursion into the lower half-plane encoun- 
ters no singularities, we can employ a contour that makes a very small detour around the 
singularity on the real axis, as sketched in Fig. 6.6. Assuming that the integrand falls suf- 
ficiently rapidly at large w’ so that the contribution of the semicircle vanishes, the contour 
integral can be separated into three contributions of the form 


-1 w-—ő 1—1 00 ¿| 
SI o imf PA do P A 
Cc u=0w $>0+ en w = (0 wg W-0 


2r 10 = 1 : 
+ f tas) (6.57) 
a e 


where the first two terms represent the contribution of the real axis and the third is the 
contribution of the small semicircle parametrized by w’ — w = de” = dw’ = ive" dé. 
Together the first two terms comprise the Cauchy principal value, denoted by 


00 =i w—Ó H= 00 "1-1 
zw W —-W 6307 =e U) =—W uri O =w 


while the third term has the value ¿x(e[w] — 1). Therefore, we obtain the net result 


dej=T4 > f “AP ay (6.59) 


w — w 


This method of evaluating contour integrals in the limit that a pole approaches the real 
axis, appears so frequently that a mnemonic formula 


1 
lim - + inô[w — w] (6.60) 


50' w -wi u—w 
is used to represent the limiting process. Here P is interpreted as an operator that converts 
an integral along the real axis to its principal value while the second term involving a 
Dirac delta function represents the contribution of the pole. Note that the contribution of a 
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pole on the contour is half the contribution it would have made if it had been interior to the 
contour. Another way to interpret this result is that the contribution of a pole on the contour 
is the average of its contributions if it is displaced infinitesimally inward or infinitesimally 
outward with respect to the region enclosed by the contour (a fence sitter that cannot quite 
decide which side to fall on). Incidentally, this result does not depend upon the details of 
the deformation of the contour — the same result is obtained using a semicircle around 
the other side or using a different parametrization of the detour, provided that its length 
vanishes in the limit 6 > 0 and that the coefficient of (w’ — w)~! is continuous at w. 

Separating e[w’] into its real and imaginary components, we obtain the pair of equa- 
tions 


1 [° Ime’ 
Re e[w] = 1+ Pf BAw gy (6.61) 
m Jo W -w 
1 [” Reew’]—1 
Im e[w] = --P ih eaen du! (6.62) 
x J. u-0 


relating the real and imaginary components of e[w] through dispersion integrals. Relation- 
ships of this general form can be derived for any function that is analytic in a half-plane 
and falls sufficiently rapidly far from the origin. However, functions representing physical 
quantities often have a more limited domain. Here, for example, measurements are made 
only for positive frequencies. Fortunately, symmetry properties permit extension from pos- 
itive w’ to the entire real axis. Recognizing that D[t] and E[t] are real functions, we require 


G[r] = f > dwe ™®™ (e[w] — 1) (6.63) 


00 


to be real also, such that 


GIT] = G[r] = e[-w] = € [w*] (6.64) 


On the real axis we find 
w* = w = Ree[-w] = Ree[w], Ime[—w] = — Im efo] (6.65) 


and with some straightforward manipulations we can express the dispersion relations in 
terms of positive frequencies only. Therefore, we finally obtain the Kramers—Kronig rela- 
tions in the form 


2 o0 a | yn 
Reew] = 1+ EP f 2 = du! (6.66) 
T 0 wW“ — WwW 
2 © Re ela] -1 
Imelo] = -£P { L du! (6.67) 
m 0 W -w 


These relations between the dispersive and absorptive properties of a dielectric medium 
are very general, depending only upon the very reliable assumptions of causality and iner- 
tial limitations of the polarization induced by extreme frequencies. Thus, if one measures 
the absorption spectrum the refractive index can be computed, or vice versa. Notice that 
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refraction must be accompanied by absorption — one cannot have one without the other 
without violating fundamental physics. In particle physics it is often easier to measure the 
energy dependence of the total cross-section, which is directly related to the imaginary 
part of the forward scattering amplitude, than it is to measure the real part of the scattering 
amplitude. Hence, dispersion integrals of similar form, based upon quite general princi- 
ples, provide important tools in nuclear and particle physics. 


6.2.4 Sum Rules 


It is useful to consider the asymptotic properties of e[w] in the limit w > oo, where the 
dispersion relations take the form 


2 Sa + ay? 
Reew] ~1-—; | w Imelo] 1+0((2) du’ (6.68) 
TO” Jo w 

2. 0 wy w’\4 

idea. | meats E (2) + (E) ) du! (6.69) 

nw Jo w w 
Assuming that the integral converges, the asymptotic behavior of the real part becomes 

w.\2 

Ree=1- (2) (6.70) 
w 


where the definition of the plasma frequency 
w, = lim (w*(1 — Ree[w))) (6.71) 


is consistent with the earlier definition motivated by the oscillator model. Therefore, we 
obtain a sum rule of the form 


2 [oe] 
w = zf wIme[w] dw (6.72) 
0 


The interpretation of the asymptotic behavior of the imaginary part is trickier. The present 
result suggests that its limiting form is proportional to w”!, but convergence of the sum 
tule for w, requires Im e[cw] to decrease asymptotically more rapidly than w7! which is 
consistent with the w~? asymptotic behavior found above for the oscillator model. In fact, 
using causality one can show more generally (see exercises) that for large frequencies 


E G” [0] 


Ime = F (6.73) 


and thus we expect to find 
f (Ree[w] — 1)dw = 0 (6.74) 
0 


and must carry one more term in the expansion of Ime. Therefore, the asymptotic behavior 
takes the form 
a? 
<P 
Ime = 275 (6.75) 
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where 
= 1 à 
Y= 5 (Re elw] — D)w* dw (6.76) 
TO, 0 


provides a second sum rule. 
If we assume that the asymptotic behavior of Re e applies when w > w,, such that 


w, \2 wy 
w> 0, = Ree= 1- (%2) sof) | (6.77) 
w W, 
we find 
os 0, ag? wt 
f (Re elw] — 1)dw = f Ree[w] dw — w, + o + ds) (6.78) 
0 0 c [+ 
Therefore, 


co Í we w. 2 w 4 
f (Re elw] - 1) dw = 0 => +f Reeloldo = 1 + (22) sof) | (6.79) 
0 W, Jo 0) w 


is known as a superconvergence relation. 

Sum rules play an important role in nuclear and particle physics. For some theories, 
such as QCD, calculations are often so intractable that it becomes difficult to test the theory 
by comparing energy-dependent predictions with experimental data. Sum rules, on the 
other hand, depend only upon very general assumptions, such as causality and symmetry 
properties. Therefore, provided that one can perform measurements over a large enough 
range of energy to ensure convergence of the experimental integral, sum rules test the 
underlying assumptions of a theory without need for detailed calculations of scattering 
processes. 


6.3 Hilbert Transform 


The Kramers—Kronig formula is an example of a Hilbert transform. Suppose that f[z] is 
analytic on the real axis and in the upper half-plane and that | f [z] converges uniformly to 
zero on the surrounding great semicircle. The Cauchy integral formula can then be used to 
write 


dé (6.80) 


00 TT ið 
fan Lf Mren e 


CS-Z Qni Jo sx Roo 27 Jo Re”? —x 


1 

flel = 2ni 
where C consists of the real axis plus the great semicircle and z is in the interior of C. 
The contribution of the great semicircle vanishes because | f [z] > 0. Thus, separating 
Flz] = Re f[z] + ¿Im f[z] into real and imaginary components, we obtain the pair of 
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equations 
Re f[z] = En e Im fis] 9, (6.81) 
2a Joo S-X 
mfz- [ “Aas (6.82) 


that relate the real component in the upper half-plane to an integral of the imaginary com- 
ponent on the real axis, or vice versa. 

Similarly, for a point on the real axis we deform C using an infinitesimal semicircular 
detour to write 


lf fis 


flix] = — ds 
278 Jo sx 
ey T l ee f flx + ee] d0 
2i Jo S- xX e0 27 Jr 
R Te AREA (6.83) 
+ lim — e” de 


R= 2n 0 Re” =% 


a Zf fil ds+ ifi 
2ni Jo 7 


s—X 


and again discard the great semicircle to obtain 


fl = ef AISI gs (6.84) 
Ti Jo S- X 


Note that the analysis is practically unchanged for functions that are analytic and decay 


as z > oo in the lower half-plane instead. Separating f[x] into real and imaginary compo- 
nents, we find 


Re f[x] = £ e Im fis] yy (6.85) 
T Jaca SHOX 
Im fx] = -7 T n ds (6.86) 


Notice that these formulas differ by a factor of 2 from those above because the contribution 
for a singularity on the contour is the average of contributions for nearby singularities on 
either side (half in and half out). If we now consider the real and imaginary components 
on the real axis to be two real functions of a real variable, we obtain the Hilbert transform 
pair 


n=? f O yes f ET (6.87) 
m Js S—X T Jog X-S 


One can assemble a table of Hilbert transforms by decomposing analytic functions that 
decay sufficiently rapidly as z > oo into real and imaginary components on the real axis. 
For example, Cos[x] and Sin[x] constitute a Hilbert transform pair because Exp[iz] is ana- 
lytic and satisfies the convergence requirements in the upper half-plane. However, the real 
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strength of this technique is to the analysis of experimental data where physical arguments, 
such as causality, guarantee the analyticity and convergence properties such that parame- 
trization of measurements for one component can be used to reconstruct the other. 


6.4 Spreading of a Wave Packet 


In quantum mechanics or optics one often constructs wave packets as a superposition of 
plane waves such that 


Wie, t] = f > w Exp[i(kx — wt)] (6.88) 


where W[k] is the amplitude of a plane wave with wave number k and where the frequency 
w = WwIk] is a function of k. The limitation to one spatial dimension is made here for 
simplicity only. If 


Wik] = FIk — ko] (6.89) 


exhibits a relatively narrow peak at k,, it becomes useful to express the Fourier integral in 
the form 


dk- 
yix, t] = Exp[ilkox — wof)] f = JIk — kol Exp[i((k — ko)x — (w — wo)t)| (6.90) 


where wọ = w[k,]. The contributions from k appreciably different from kọ will tend 
to interfere destructively because the phase of the complex exponential varies rapidly. 
Expanding the temporal frequency as 


(k = k Y 
w[k] x wo + wilk = ko) + 093 + (6.91) 
we can write 
; "dk. ; Wyk? 

Wx, t] ~ Exp[i(kyx — wot)] an Ml Exp ec — w,t)- | (6.92) 
where k = k — ky. Notice that if w, = 0, one obtains 

w, = 0 = yix, t] = fix- Vat] Expli(k,x — wot)] (6.93) 
where 

> dk ~ . % dl —iKx 

f= |] AdExplicd, Jas | dxe™ fx) (6.94) 

is the initial shape of the wave packet at time 1 = 0 and where v, = w, = W'[k,] is 


the group velocity at the peak of the momentum distribution. Under these conditions, the 
intensity 


Wx, JP = [fle — vet)? (6.95) 


propagates as a wave with velocity v, without changing shape. Therefore, if w[k] is linear, 
the propagation is described as nondispersive because the shape of the wave packet is 
preserved. However, if w, + 0, the wave packet will usually spread with time. 
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The phase and group velocities are related by 


d dy v k dv 
a vw = =v, =v, tk P > ls E 


€ dk dk v Vp dk (6-20) 


which for a narrow spectral distribution should be evaluated near the peak, kọ. From optics 
we describe situations where dv,/ dk < 0 = v, < v, as normal dispersion and dv,/ dk > 
0 = v, > v, as anomalous dispersion. 

Consider a Gaussian wave packet of the form 


2,2 
FIK] = (4ro?)% Exp E | (6.97) 
-1/4 ea 
fix] = (70?) “Exp -5| (6.98) 
normalized according to 
co 2 B co dk 2 B 
f dfi = f Sift = 1 (6.99) 


such that 
yix t] = (Aro?) 4 Exp [ ¿(kx - wot)| 


œ dk (o? + NS i 
f 5 Exp | ELE ee wn) (6.100) 


Completing the square, we obtain a wave packet of similar form 


2 1/4 . = = 2 
ie a=(%) Exp[i(kyx — wp1)] Exp|- (x - 0,1) | (6.101) 


T (0? + it)? 20? + iwt) 


except that the width 


o — 0? + iw,t (6.102) 


becomes time dependent and complex. Using 


1 l-i t 
T withr = 22 (6.103) 
GC: 


o? + iw,t ol +7?) 2 


the intensity profile takes the form 


_ 2 
wix, AP = (ra +? Exp e (6.104) 
or 
-1/2 
A ic A 
Wx. e] = (no (+(22)) Exp A) (6.105) 
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Figure 6.7. Spreading of a Gaussian wave packet. 


Therefore, we find that a Gaussian wave packet remains Gaussian in shape but that its 
width increases with time according to 


ro ol $ (es y (6.106) 


The spreading of the intensity profile for a typical wave packet is shown in Fig. 6.7. 

It is important to recognize that the width increases much more rapidly for a narrow 
than for a broad wave packet. A brief pulse contains a broad spectrum of frequencies, but 
high frequencies propagate more rapidly than low frequencies in a medium with normal 
dispersion (w, > 0). Thus, the high-frequency components race ahead of the lower fre- 
quencies and broaden the pulse. This effect can be seen by examining the real and imag- 
inary components separately, as shown in Figs. 6.8 and 6.9. Notice that the two figures 
have different spatial scales — the pulse in Fig. 6.8 is initially twice as wide as the pulse 
in Fig. 6.9, but after some time the latter is much wider than the former. It should also be 
obvious that the higher frequencies dominate the leading side while the lower frequencies 
dominate the trailing side of the pulse. We chose a ratio v,/v, = 0.5 between group and 
phase velocities that is typical of light in a plastic scintillator or voltage in a coaxial cable. 
Our choice of dispersive coefficient, w,, was somewhat arbitrary, but it is clear that narrow 
signals can be distorted quite rapidly by dispersive media. 

The relationship w = w[k] between frequency and wave number is often called a dis- 
persion relation because it governs the spreading of wave packets. A linear relation is 
nondispersive, but nonlinear components of the dispersion relation generally produce dis- 
persion of the wave packet because Fourier components with different frequencies prop- 
agate with different phase velocities. However, the shapes of nongaussian wave packets 
may change with time in more complicated ways. Furthermore, in nonlinear media there 
are sometimes special profiles, called solitons, for which the distribution of Fourier ampli- 
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Figure 6.8. Spreading of the real and imaginary contributions to the intensity of a relatively broad 
wave packet. 


Figure 6.9. Spreading of the real and imaginary contributions to the intensity of a relatively narrow 
wave packet. 


tudes f[k] is matched to w[x] such that the wave packet actually retains its initial shape as 
it propagates. 
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Figure 6.10. A potential that exhibits spontaneous symmetry breaking. 


6.5 Solitons 


Consider a classical field 6 = ¢[x, t] that satisfies a one-dimensional wave equation of the 
form 


Po Fo ov 
(ex — 


T == — 6.107 

Ot? Ox? 09 ( ) 
where o and 7 are inertial and stiffness constants and where 

Vid] = -4A¢? + Bd" (6.108) 


with positive constants A and B is a potential that exhibits a spontaneously broken sym- 
metry. Almost by inspection we recognize that V[@] has a local maximum at ¢ = 0 and 
symmetric minima at 6 = +VA/B with depth —A?/4B. Thus, it is useful to express V[@] 
in the form 


_A fA = oy (sÝ 
a aa) -(£)) 6109) 


which is plotted in Fig. 6.10. Although the potential is symmetric with respect to ¢, the 
equilibrium state with ¢ = 0 is unstable and is not the ground state. There are two stable 
ground states at ¢ = +0,, that do not share the symmetry of the potential — at low temper- 
ature the system will be found near either the positive or the negative state and this choice 


violates the symmetry of the equation of motion. The situation in which the ground state 
does not share the symmetry of the dynamical equations is described as a spontaneously 
broken symmetry. For example, the dynamics of a ferromagnet are rotationally symmetric, 
but in the ground state the constituent magnetic moments are aligned in some arbitrary 
direction that violates rotational invariance. 

In this section we demonstrate that, in addition to familiar wave solutions for small- 
amplitude oscillations about either of the minima, this highly dispersive nonlinear differen- 
tial equation also possesses propagating soliton solutions that induce transitions between 
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the stable states. We do not attempt to develop general methods for finding such solutions, 
which are beyond the scope of this course, but will be content to discuss the general prop- 
erties of stipulated solutions in order to gain some appreciation for the phenomena. Models 
of this type find widespread application in condensed matter and particle physics. 

It is useful to define the intrinsic wave speed in the absence of V as c = Vt/o and to 
divide out ¢,, to obtain a wave equation for y = ¢/¢@,, that takes the form 

19%y y 3 

y O A — y? 6.110 

ap e Ni (6.110) 
where 8 = 4V,/TØ2, = A/T. First we demonstrate that there exist approximate solutions 
of the form 


yix, t] = +1 + æ Sin[kx — wt + ô] (6.111) 


with œ < | that represent small-amplitude oscillations of y around one of the potential 
minima. Substituting the trial solution and expanding to lowest order in a, we find 


Cc 


2 
= (= = eje Sin[kx — wt + ô] = B(-2a Sin[kx — wt + 6] +--+) (6.112) 


Therefore, we obtain a highly nonlinear dispersion relation of the form 
w = (+28) (6.113) 


with phase velocity 


w 28 2A 
ARE RE (6.114) 


that is independent of the oscillation amplitude. Note, however, that because the equation 
is nonlinear the accuracy of the solution does depend upon amplitude — the smaller the 
amplitude the more accurate the solution. Also notice that the phase velocity for long 
wavelengths is increased dramatically by the harmonic part of the potential. The left side 
of Fig. 6.11 shows the phase velocity where kọ = y2A/T. 

Next, we demonstrate that there exist exact solutions of the form 


yix, t] = Yo Tanh[é(x + vt)] (6.115) 
for suitable choices of Wo, $, and v. Using 


Tanh[z] 


CoshlzP? (6.116) 


d 

— Tanh[z] = -2 

dz = 
direct substitution gives 


== B(Wy Tanh[E(x + vt)] — yà Tanh[£(x + ve) JP) 
(6.117) 


5) Tanh[€(x + vt)] 


2 — 
os ( Cosh[é(x + vt)? 


2 
C 
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Figure 6.11. Dispersion relations for Eq. (6.110). Left: small-amplitude waves. Right: soliton solu- 
tions. 
or 
y2 
= == ]- Fa Coshlets + vt) — Y Sinh[é(x + vo?) (6.118) 
c 


The space and time dependencies of the right-hand side can be eliminated by choos- 
ing Y, = +1, such that 


v=c m (6.119) 


where we choose the positive root because the trial solution already includes the propaga- 
tion direction. A soliton of this form propagates without changing shape, but the amplitude 
is not arbitrary and the speed depends upon the slope parameter €. Unlike small-amplitude 
waves which propagate with v > c, large-amplitude solitons propagate with velocity v < c. 
The dependence of the propagation velocity upon the slope parameter is a form of disper- 
sion, but unlike a familiar wave the amplitude of the soliton is not arbitrary because the 
equation is nonlinear. 

A snapshot of the soliton that propagates in the positive x-direction is sketched in 
Fig. 6.12. The soliton is a propagating disturbance that switches the local field from one 
stable state to the other as it passes and is often described as a kink. The distance over 
which the transition occurs is characterized by A = £7!, the inverse of the slope of the kink 
at the origin. Thus, we can express the phase velocity in the form 


(6.120) 


where Ay = V2Br/A? is a characteristic distance. The smaller the distance over which 
the transition occurs the stronger the kink and the greater its speed. Very strong kinks 
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Figure 6.12. A soliton solution to Eq. (6.110). 


with A > 0 approach the asymptotic speed c while weak kinks with A > A, propagate very 
slowly. In fact, the existence of this type of solution with real v requires 


à < Ay = L < 22 = é > VA/27 (6.121) 


The soliton velocity is also shown in Fig. 6.11. 

Solitons are commonly found in systems described by nonlinear equations; in fact, 
linear equations are often just an approximation for small amplitudes. However, if the 
compensation between dispersion and nonlinearity needed for solitons to propagate with- 
out changing shape were too delicate, it could be destroyed by external disturbances or by 
imperfections or impurities in real systems. In fact, stability is crucial to the observation 
of solitons, which otherwise would be little more than mathematical curiosities. Although 
it is beyond the scope of this course, one can often demonstrate explicitly that solitons are 
stable with respect to perturbations and propagate over long distances with little distortion 
even under nonideal circumstances. When analytical techniques fail, numerical simula- 
tions can be used to show that suitable initial profiles evolve into one or more solitons 
which then propagate stably. 


Problems for Chapter 6 217 


Problems for Chapter 6 


1. Analytic continuation of simple functions 

The following functions are defined by definite integrals which converge in domain D. 
For each, specify the original domain D, construct an analytic continuation into the largest 
possible region of the complex plane, and specify the analytic domain for the parent func- 
tions. 


a) fiz] = T e” Sin[t] dt (6.122) 
0 


b) s= er" dt forintegern > 0. (6.123) 
0 


2. Euler transformation 
Suppose that f[z] is represented as an alternating series of the form 


fil = )\(-"a,2" (6.124) 


n=0 


that converges for |z| < R. In this problem we explore a procedure for analytic continua- 
tion that uses differencing of coefficients to rearrange a power series in order to obtain a 
representation that converges more rapidly and in a larger domain. Additional variations 
of this method are omitted for the sake of brevity. 


a) Show that 
(1+ 2fldl =a) +z )\(-)"6a,2" (6.125) 
n=0 


where 6a,, = a, — 4,,,¡- Continuing this process, show that 


(1+ fle] = a +20, +2) "Paz (6.126) 
n=0 
where 
= Z m = m—1 
¿=> a, = 56""a,) (6.127) 


and obtain thereby a series expansion for g[¢] = (1+z)f [z]. Find an explicit expression 
for ô”a,. 

b) To demonstrate the usefulness of the Euler transformation, demonstrate that the Euler 
series for g[f] converges much more rapidly near ¢ ~ 4 than does the Taylor series 
for f[z] = Log[1 + z] near z ~ 1. In fact, show that g[Z] converges for Re[z] > -3 
independent of Im[z], which is a much larger region than for the original Taylor series. 
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3. High-frequency response 
The temporal response of a dielectric medium to an applied electric field was expressed in 
the form 


Dit] = E[t] + f i dtE[t — 7]G[7] (6.128) 


00 


G[t] = f > dwe *"(e[w] — 1) (6.129) 


00 


Express €[w] in terms of a Fourier transform and perform a Taylor series of G[7] applicable 
for tT > 0*. Note that this is a somewhat unusual series because G[t] = 0 for 7 < 0, so 
that derivatives at 7 = 0 are understood as limits for 7 > 0*. How is the high-frequency 
behavior of e[w] related to the time dependence of G? Compare your results with those of 
the semiclassical oscillator model. 


4. Absorption band 

Suppose that a material absorbs electromagnetic radiation only in the band w, < w < wy. 
For simplicity assume that Im € = 77 is constant within, and vanishes outside, the absorption 
band. Use the Kramers—Kronig model to evaluate Re e. Sketch this function and compare 
with the single-mode model. 


5. Subtracted dispersion relations 

If f[z] does not vanish as z => oo, or does not diminish rapidly enough to ensure good 
convergence of the dispersion integrals, one can apply the technique to the function z~! f[z] 
instead. More generally, dispersion relations for a function of the form g[z] = ( fla - 
f [xo])/ (z — xo) will converge more rapidly than for f[z] itself, especially if an optimum 
choice of x, is made. Derive dispersion relations for g[x], assuming that g[z] is analytic in 
the upper half-plane and vanishes on a great semicircle, using a contour with detours on 
the real axis around both x and xy. Then deduce the dispersion relations for f[x]. 


6. Hilbert transforms 
Given 


vix] = -7 T Ml gaa (6.130) 


Sy = 
s-x 1+x 


determine u[x] and construct the corresponding analytic function f[z] = u[x, y] + iv[x, y]. 
Then verify that f[z] satisfies the necessary requirements for (u, v} to constitute a Hilbert 
transform pair. Note that u[x] = u[x, 0] and v[x] = v[x, 0]. 


7. Dispersion of a Gaussian wave function 
Suppose that at time £ = 0 an electron is represented by a wave packet of the form 


2 kek 2 
so _ m (k= ko) | (6.131) 


1/4 
) z 


JIK] = (403) E 


where kọ and op are constants. Assume that the velocity is nonrelativistic, such that the 
kinetic energy is given by e = hw = (hk)?/2m. 
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a) Evaluate the wave function in space and show that it is normalized properly. 
b) Find expressions for the phase and group velocities. 


c) Evaluate the time dependencies for the mean position, x, and root-mean-square (rms) 
width, o, where 


X(t] = (Ya, ld lx, 1]) = f W' Lx, tly[x, tle dx (6.132) 
ole] = (wa, t]|@ - lye, tl) = f > y“ ix tly, t](x — ¥? dx (6.133) 


d) Suppose that an electron with 10 keV kinetic energy is initially confined to œ = 1 Å, 
about the size of an atom. How much time is required for appreciable spreading of the 
wave packet? How far does it travel in that time? How much time does it take for the 
wave function to become broader than a typical laboratory? 


8. Plane electromagnetic waves in conducting media 
One can show that electromagnetic waves satisfy a wave equation of the form 


2, 
(v a Ee es eee == (6.134) 
C 


when the permittivity €, permeability u, and conductivity o are constant. The electric field 
for a plane wave with frequency w is the real part of 


È [?, t] = Ey Exp li (k oe wt)| (6.135) 
a) Separate the wave number k = (n + iy)w/c into real and imaginary parts. Discuss the 


dependence of n and y upon frequency for both low and high conductivity. 


b) The electric and magnetic fields are related by 


ee 10B 
Vx E == --— 6.136 
ü c ôt ( ) 


Evaluate and discuss the dependencies of the relative magnitude and phase of the 
magnetic and electric fields for plane waves upon frequency. 


9. Signal transmission in coaxial cable 

A coaxial transmission line consists of two concentric cylindrical conductors separated by 
a dielectric. The voltage and current changes across a length Az of the inner conductor are 
described by 


AV ol 


ag TRIO Le (6.137) 
Al ov 

al L gye 1 
A (6.138) 


where R, L, and C are the resistance, inductance, and capacitance per unit length and G 
is the conductance per unit length for leakage current between the conductors. R, L, G 
usually vary relatively slowly with frequency. 
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a) Show that both the voltage and the current satisfy decoupled equations of the form 


1% 2 ð V 
(3 a 92 a DA =o (6.139) 


Determine the phase velocity for an ideal lossless cable with R > 0, G > 0. Under 
what conditions do signals propagate without dispersion? 


b) Using plane-wave solutions of the form V[z, t] œ Exp[i(kz — wt)], evaluate the dis- 
persion relation k[w] = x + iy and determine the frequency dependencies of the wave 
number, x, and attenuation coefficient, y. Also determine the phase velocity. Discuss 
the limiting cases of 


(i) an ideal lossless cable, 
Gi) high frequencies, and 
Gi) G0. 
c) Express the characteristic impedance Z = V/I for sinusoidal solutions in terms of the 


parameters (R, L, C, G) and the frequency w. Determine the phase difference between 
the voltage and the current for high frequencies. 


d) A logic pulse 
VIO, t] = VOL] - Volt - T] (6.140) 


is injected at z = 0 and propagates on the cable for z > 0. Produce a Fourier represen- 
tation for the wave form at later times; however, do not waste much time attempting 
to evaluate the integral, which is very difficult. 


10. Landau damping 
One can show that longitudinal density waves in plasma described by n[x, t] = no Exp| i(kx- 
wt)| satisfy a dispersion relation of the form 


2 
k > Of/dv 
(= =f IIO dy (6.141) 
p —00 


a 2 . . . . 
where w, = nye*/eym is the plasma frequency, ng is the average electron density, m is the 
electron mass, and 


2 
flv] = 272)! Exp a (=) | (6.142) 


is the Maxwell velocity distribution expressed in terms of the characteristic thermal veloc- 
ity vp = yKkg1 /m. Treat the wave number k as a real variable and w[k] as a complex-valued 
function of k to be obtained as a solution to this integral equation. Assume that w, > kvy. 


a) Notice that the integrand would have a pole on the integration path if w were real, but 
w may actually be complex. Alternatively, we can assume that the pole is practically 
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upon the real axis and use a semicircular detour to avoid it. Show that the dispersion 
relation takes the form 
2 
k '00 1 
(E) ap f vz int [2| (6.143) 


Wy v- w/ 


depending upon the choice of detour and provide a physical criterion for selection of 
the proper sign. 

b) Approximate the principal-value integral using the assumptions w ~ w, and w, > 
kvr. 

c) Find an approximate solution for w[k]. Which contour deformation is required? 


11. Faster than light? 
Suppose that a one-dimensional electromagnetic wave is described by 


yix, 1] = f l Hal Exp [i (to = a)| (6.144) 


where n[w] is the complex refractive index and that 


W[x>0,t <0] =0 (6.145) 
Thus, the spectral amplitude is given by 
Uw] = { dtw(0, t] Expliwt] (6.146) 
0 


where the frequency w is real. It is useful to extend 4[w] > ğ[z] into the complex plane. 
On physical grounds we expect n[w] => 1 for very large frequencies and we assume that 
n[z] is analytic in the upper half-plane with n[z] > 1 when |z| > oo. Evaluate the signal 
W[x, t] for x > ct. Many systems have frequency bands where the phase velocity exceeds 
light speed because n[w] < 1; how does that situation affect the result? 


12. KdV equation 
Water waves on the surface of a shallow channel satisfy the Korteweg—deVries equation 
ð 0 8 
E ee ee (6.147) 
Ot x 
where the variables have been scaled conveniently and where certain approximations that 


are needed for the derivation remain implicit. This equation nicely illustrates how a suit- 
able balance between dispersion and nonlinearity can produce a soliton. 


a) Suppose that the amplitude is sufficiently small to omit the nonlinear term, such that 


do =P 

Ot a == 0 (6.148) 
Show that there exist solutions of the form 

lx, t] = f blk] Expli(kx — wt)] dk (6.149) 


and deduce w[k] for this linear approximation. 
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b) Next, suppose that œ is small, such that 
— +n = (6.150) 
x 


Show that there exist traveling waves of the form n[x, t] = n[x — vt] where v depends 
upon the amplitude 7. Describe qualitatively the time development of 7 given that 
n[x, 0] initially has a bell shape. You will not be able to construct a familiar single- 
valued function, but this method does provide an algorithm suitable for numerical 
evaluation. 


c) Next, show that if we assume a traveling wave solution of the form W[x, t] = y[x— vt], 
the KdV equation can be integrated once. Evaluate the constant of integration when 
Wix, t] = 0 fort > +o. 


d) Finally, show that there exist solitons of the form 
yix, t] = A Sech[B(x — vr)? (6.151) 
for suitable choices of A and B. Provide a physical description of this solution. 


13. Sine-Gordon equation 
The sine—Gordon equation 


wlx, Owe, f 
i SBS = slot as 


arises in models of systems with periodic potentials. In this form all variables are dimen- 
sionless. 


a) Deduce the potential V[w] for which the inhomogeneous term is given by —0V/0y. 
Identify the stable and unstable equilibrium states of the field. Show that there exist 
position-independent solutions that describe small-amplitude oscillations about a sta- 
ble equilibrium state of the field. 


b) Construct approximate solutions that describe traveling waves of small-amplitude 
oscillations about the stable states and deduce the corresponding dispersion relation. 


c) Show that there exist soliton solutions of the form 


= VÍ 

Y, = 2n7 + 4ArcTan [exo[s =| (6.153) 
iio? 

Sketch and provide a physical interpretation of these solitons. (It is sufficient to verify 

the solutions by substitution and to use MATHEMATICA or equivalent software to per- 

form the algebra.) 
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Abstract. The Sturm—Liouville operator with suitable boundary conditions provides 
a self-adjoint system with real eigenvalues and a complete set of orthogonal eigen- 
functions. We explore the general properties of such systems and their role in con- 
structing Green functions for physical systems. Perturbative and variational methods 
are also introduced. 


7.1 Introduction: The General String Equation 


The displacement Y[x, t] of a finite, but not necessarily uniform, string can be described 
by a Lagrangian of the form 
b 
L= £Ldx (7.1) 


a 


where the Lagrangian density £ takes the form 


L=T-V (7.2) 
2 

T= Lota) = 5o[x]¥? (7.3) 
2 

y= eaf) + EY = ir + 3119? (1.4) 


Here o is the linear mass density, 7 is the tension, and x is the stiffness parameter for an 
additional linear restoring force that might be produced by the coupling of the string to 


another system. It is useful to define generalized velocities 

ov OV 

==, == 7.5 
‘Ot * Ox Ua 


based upon the generalized coordinate ¥. The equation of motion is determined by mini- 
mizing the action, whereby 


ty ty b OL. ðL ðL 
a f Ldt =0= | a f AE ot, + Oe] =0 (7.6) 


1 


where the endpoints in both space and time are held constant, as are coupling functions o, 
T, K. Substituting 


Oo¥ OoV 
ME Sea 
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(7.7) 
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h e (ðL ðL ðL 
dt | ad oy y y 
I ff (5p? * oy OF + ay? ) 


i b fOr OL að aL A5¥ 
=f a f aitt ae a +T | (7.8) 


and using the boundary conditions 
x=ab or t=t,,t, = d¥ =0 (7.9) 


to integrate by parts, we find 


h h b ƏL 00L ðL 
Ldt = d d = = Y == 71 
ô f t = 0 => I t Í d ae” a OE, a a 0 (7.10) 


1 
This equation must be satisfied for independent variations óY, which then requires that 
the coefficient of 6 vanishes anywhere in the spacetime interior. Thus, the continuum 
Euler-Lagrange equations take the form 


ð ðL 90L ðL 
ai ov, * dxa¥. ow” vere 


Therefore, using 


g 7 TTY (7.12) 
OL 
ay = ox], (7.13) 
OL 
ap = MX, (7.14) 


we finally obtain the general string equation 


ta) - olx] a — K[x]¥ = 0 (7.15) 


The normal modes of vibration can be analyzed by hypothesizing the sinusoidal time 


dependence 


Pix, t] = yix] Exp[-ior] (1.16) 


such that 


d d 
T (r | + wo [xlwlx] — «[x]y[x] == (7.17) 
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represents the spatial dependence. In addition, one must impose boundary conditions on 
the normal modes. For example, if the string is clamped at both ends one would require 


clamped => wW[a] == y[b] = 0 (7.18) 


while if the string were free at either end we would require the derivative to vanish there 
instead. In general, nontrivial solutions that are consistent simultaneously with two sepa- 
rate boundary conditions are possible only for particular characteristic values of the fre- 
quency. These characteristic frequencies, w,,, are called eigenvalues and the corresponding 
spatial functions, w,, are called eigenfunctions or normal modes. Thus, the nonuniform 
string is described by an eigenvalue problem of the form 


d d 
(rll ou) — Klxly,[x] + lol, [x] =0 (7.19) 


n? 


This equation reduces to the familiar case of a vibrating string in the special case of 
constant 7, constant o`, and vanishing x, such that 


a 
tx] 27, olxloo, «Kix -> 0 = y = ky, (1.20) 
x 
where 
2 T 
w, >k, C=— (1.21) 
Oo 


Finally, if a string of length / is clamped at both ends, the eigenvalues are simply k,, = nz// 
where n is an integer. The solutions 


nx 


2 
yi0] = yi = 0 = y, [x] = Sin 1 (7.22) 


form a complete orthonormal set with 


1 
S 9,1010, dx =o, 023) 
0 


and can be used to represent the spatial dependence of any piecewise continuous function 


co 


Í 
flx] = Sa, alx] = a, = f Y, [x]flx] dx (7.24) 


n=1 


as a Fourier sine series. These familiar and useful properties can be generalized to the 
nonuniform string, with suitable modifications to accommodate variations of tension or 
density. (Note: the term orthonormal means orthogonal and normalized.) 
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The general string equation is one example of a Sturm—Liouville eigenvalue problem. 
Let 
d 


d 
HE -qid => Dy = © (oie 


dix] 


D ariel 
dx 


=P )- aut (7.25) 
x 

represent a linear differential operator based upon the coefficient functions p[x] and g[x]. 

Based upon the mechanical problem discussed above, we could describe p[x] as a force 

density and g[x] as an inertial density. An eigenvalue equation for a system with weight or 

metric function w[x] that is analogous to the mass density then takes the form 


dip{x] 
dx 


d 
(D + Aw)W = 0 —> de (pix 


) sabiei (7.26) 


where one expects nontrivial solutions, w,,, for discrete eigenvalues, A > A, when suitable 
boundary conditions are applied for a finite interval. Many physics problems can be rep- 
resented in this manner. Therefore, in this chapter we shall study the general properties of 
one-dimensional Sturm—Liouville systems and the application of eigenvalue expansions to 
the solution of inhomogeneous equations of the general form 


(D + Aw)W[x] == fix] (7.27) 


The chapter on boundary-value problems will later apply these techniques to problems 
in two or three spatial dimensions using separable coordinate systems. After separation 
of variables one generally obtains Sturm—Liouville systems for each spatial dependence 
and these systems are connected by means of separation constants that depend upon the 
eigenvalues for the other equations. 
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In the broadest sense a Hilbert space is any infinite-dimensional linear vector space with an 
inner product. An abstract linear vector space S is a set of elements (vectors) with addition 
and multiplication operations that satisfy the following properties. 


1. closure under addition: Yf, g € Sh=f+geS 

2. commutativity of addition: f + g = g+ f 

3. associativity of addition: f+(g+h)=(f+g)+h 

4. existence of additive identity: 10 E€ Sə f +0 = f 

5. closure under multiplication by scalar: Vf € S,af € S 
6. commutativity of multiplication by scalar: (Bf) = (a6) f 
7. distributive law: a(f + g) = af + ag 


8. associative law: (a + B)f =af + Bf 
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9. existence of multiplicative identity: 11 € Sa 1f = f. 

10. existence of additive inverse: Vf eS dge Sa ft+g=0. 
The inner product (f | g) for a real Hilbert space must satisfy: 
1. F18 lf) 

2. (flgt+h)=(flart+ (fla) 

3. <f | f) = O with equality iff f = 0, 

while for a complex Hilbert space we require: 

L. F18 =l sy 

2. flath = Flat (flr) 

3. (f | f) = 0 with equality iff f = 0. 


A linear Hermitian operator £ acting upon the elements f, g € S is defined by the 
requirements 


g=Lf=>gES (7.28) 
Llaf + Bg) =aLlf + BLg (7.29) 
(f | Lg) =(g I LFY =(Lf12 (7.30) 


where a and £ are arbitrary constants. The complex conjugation properties are often com- 
bined in the statement 


PE (7.31) 


where £t is the Hermitian adjoint of £. For an ordinary vector space, £ is a square matrix 
and L" is the complex conjugate of its transpose; hence, a Hermitian matrix is identical to 
its Hermitian adjoint and is described as self-adjoint. The notation 


L =L= (Lflo=FlL)=FILIo (7.32) 


expresses the fact that a Hermitian £ can act to the right or its adjoint £* can act to the left 
without changing the value of the matrix element. 
Two elements of S are described as orthogonal when their inner product 


fig = (flg)=0 (7.33) 


vanishes. It is often useful to generalize the notion of orthogonality to include a metric 
function w, where in a conventional vector space w is a positive-definite diagonal matrix, 
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meaning a diagonal matrix whose elements are real, positive, and nonzero. A matrix of 
this type is clearly self-adjoint, such that 


(flwlg* =(glwlf) =w=w! (7.34) 


satisfies the complex conjugate property required for the inner product of a Hilbert space. 
Thus, two elements are described as orthogonal with respect to w when 


fLe = (flwlg)=0 (1.35) 


This notion of orthogonality with respect to a metric function will prove useful in more 
general Hilbert spaces also. 

For the purposes of this chapter we define a particular type of Hilbert space consisting 
of functions of a single real variable that are smooth and continuous on a finite interval 
a < x < b and satisfy boundary conditions of the form 


B,f = BF (7.36) 


where 


n ð J n ð j 
B, = Zela) , B,= Zela) (1.37) 


are linear differential operators evaluated at the endpoints of the domain. Hence, we require 
the elements f[x] € S to possess at least n continuous derivatives in the interior of the 
domain. We will refer to real Hilbert spaces with boundary conditions of this type as 
Sturm—Liouville spaces and complex Hilbert spaces of this type as Hermitian spaces. The 
simplest boundary conditions can be further classified as 


Dirichlet: fla] == f[b] == (7.38) 
Neumann: f’ [a] == f’[b] == 0 (7.39) 
periodic: f[a] == f[b], fla] == f'[b] (7.40) 


More generally, boundary conditions with Bf == 0 are described as homogeneous. 
Similarly, we define the inner product for a real function space as 


b 
ao] Flxlglx] dx = (g | f) (7.41) 


while for a complex function space we define 


b 
(f 18) al F[xlglx] dx = (8 1 fY (7.42) 


Notice that these definitions implicitly impose another constraint on S — the existence of 
an inner product requires the elements of S to be square integrable. Two elements of S are 
described as orthogonal with respect to the metric function or weight w[x] when 


fte =(flwlg=0 (7.43) 


where w[x] = 0 is real and nonnegative within the domain of S. In fact, we generally 
assume that w[x] is free of interior zeros. Usually the requirement of square integrability 
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is less demanding with than without the metric function. When using a nontrivial metric 
function, we often refer to the matrix element 


b 
(f |w |g) =f flxlglx]wlx] dx = (gl wl f) (7.44) 
for real or 
b 
(f |w |g) =H F"lxlelxJwlx] dx = (gl wl fy (7.45) 


for complex functions as the inner product, leaving the phrase “with respect to w” under- 
stood. For the general string equation where w[x] is the mass density o [x], we interpret the 
combination O [x] dx = dm as the mass of a differential element of string. The product of 
two functions f[x] and g[x] is naturally weighted by the mass dm carried by an infinitesi- 
mal length dx. 

A linear differential operator D of order n is defined by 


D- darla) (7.46) 
£ 


where each d; [x] is a fixed function of x that is independent of the element of S to which it 
is applied. We assume that the functions d [x] are finite and continuous within the domain 
of S. A differential operator is self-adjoint in a Sturm—Liouville space when 


b b 
| FO glx)) dx =f BOF dx => (f1D1g) =(g | DIP) (7.47) 


or in a Hermitian space when 


b b = 
f FLO gx) dx = (f s DIDID as) S(fIDIg=@IDIf (7.48) 


such that D can be shifted from one side to the other without changing the value of the 
matrix element. 


7.2.1 Schwartz Inequality 


The Schwartz inequality 


Kiwi? <lwl piglwig) (7.49) 


where equality pertains iff f = g should be familiar from linear algebra and applies to any 
Hilbert space, but we review the derivation to be sure. The theorem is obviously true for 
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the trivial case in which either vector is null, so we assume that neither is. Let h = af + bg 
where a, b are arbitrary scalars, such that 


h=af+bg=>(hlwlh)=4“aflwlf)+ab(flwlg+ab(glwlf) 
+bb(glwlg,=0 (7.50) 
If we choose 
a=(glwig)=(glwisXflwilfpt+oflwig)t+bglwif)t+bb=0 (7.51) 
and then choose 
b=-(flwlg = (glwisxflwif)—-<glwi fXfIwlg 20 (7.52) 


we obtain the Schwartz inequality. 


7.2.2 Gram-Schmidt Orthogonalization 


Suppose that we possess a set of m linearly independent but possibly nonorthogonal vec- 
tors {u p j = 1, m} and wish to construct orthogonal linear combinations 


OF = > a; ju; (7.53) 
j=l 
such that 
i+ j= (d|wl¢;) =0 (7.54) 


A simple procedure constructs the set {@;} one at a time, subtracting off the projections 
upon previous vectors. Choose 


ġ =u (7.55) 


and then assign 


(6, lwlw) 
$y = Uy (6, Iwo) Iwl 6)" (7.56) 
such that 
| w | 
(i | wl by) = day Jw Lu) = Tn Iw |i) =0 (7.57) 
Continuing in this manner 
(h; | w | uy) (b> | w | uz) 
= Uy — - 7.58 
52876, wo Owo) Mera 
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ensures 
E ICI (6, lw ly) 
(9, |w1 3) = (u |w | u3) rua) l A Glwie ES 
=0 
(7.59) 
= _ (u; |w | 3) _ ($ |w | u3) 
(Ø, |w I3) = (0, | w | u3) awp Avión A oe 
=0 
(7.60) 


This procedure can now be repeated enough times to produce m independent orthogonal 
vectors of the form 


ilw ai 
$=, Oy, = Uy OA (7.61) 


Usually we prefer an orthonormal basis, where the vectors are both mutually orthogonal 
and normalized, such that 


Y= ep, | w | 9) 0, (7.62) 
where 6; are real phases which may be chosen at our convenience. 


7.2.2.1 Example: Legendre Polynomials 


Suppose that we wish to construct a set of polynomials ¢,[x] on the interval (—1, 1) that 
are orthogonal with respect to the inner product 


1 
les f Fx] glx] dx (7.63) 


We can start with a nonorthogonal basis u,, = x" with n = 0 and apply the Gram-Schmidt 
orthogonalization procedure 


bo =1, (Gol bo) = 2 (7.64) 
| he ! 2 
psr- GU dy aang | rdis, vlgs f Par=5 (1.65) 
Do (Go | Uy) (0, | uy) 


by =x $ 


= 2S f eax fe 4 
CHED Glee E di—x | Pdr=e- 3 (7.66) 


to generate as many terms as our patience permits. Obviously, this is a job for a machine. 


Table [LegendreP [n, x], {n, 0, 5}] 


2 , 


i 1,3% 3x) 5 3 15x? 35x* 15x _ 35x? 63x” 
X, > = = 

2 2 2 2 8 4 8 8 4 8 
The Legendre polynomials are normally defined using the conventional normalization 


P.[1] =1. 
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7.3 Properties of Sturm—Liouville Systems 
7.3.1 Self-Adjointness 


An important example of a self-adjoint operator is the Sturm—Liouville operator 


d d 
£= T (ora - qx] = Lf] = ph’ be) + p 1 Lx] — aed fb) (7.67) 
X xX 


where p[x] and q[x] are real functions and where we can assume, without loss of generality, 
that p[x] is nonnegative. Suppose that f[x] and g[x] are two piecewise smooth functions 
that both satisfy specific boundary conditions at x = a, b but are otherwise arbitrary. Such 
an operator is self-adjoint whenever these boundary conditions ensure that 


b b * d b d *\b 
f | f ets £fts do] = (r) = (sp f ) (7.68) 


(This result is obtained by integrating both sides by parts.) Thus, we require boundary 
conditions which will ensure 


PLb\ Fb)" g [b] — sib] f TbI®) = plal(flal"g [a] — gla] flal) (7.69) 


for any f, g e S. Boundary conditions which ensure that a differential operator is self- 
adjoint are sometimes described as self-adjoint boundary conditions. 

First, consider a regular Sturm—Liouville system for which p[x] is strictly positive 
throughout the interval, having no interior zeros and being nonzero at both endpoints. 
Perhaps the simplest boundary conditions with this property are the Dirichlet conditions 
fla] == f[b] == 0. Also simple are the Neumann conditions f'[a] == f’[b] == 0. More 
generally, one can show that any linear conditions of the form 


Bf = afla] + 0, f la] == (7.70) 
B,f = Boflbl + B,f [bl = (7.71) 


where a; and £, are fixed real constants (independent of f), will suffice to ensure that 
£ is self-adjoint. Such conditions are described as unmixed because each equation only 
involves values at one endpoint and as homogeneous because B, f == 8,f == 0. A bound- 
ary condition that involves both a value and a derivative at the same point is described as 
intermediate (between Dirichlet and Neumann). When p[a] = p[b], the periodic boundary 
conditions 


fla] == fib], Fla] = f'[b] (7.72) 


will also produce a self-adjoint system. Periodic boundary conditions compare values at 
the two endpoints and may be described as mixed. Sometimes it is possible to employ more 
complicated mixed boundary conditions, but we will not. 

Next, suppose that p[x] vanishes at one or both of the endpoints. At first glance it might 
appear that one need not impose boundary conditions upon the elements of S, but since 
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£ usually arises in connection with a differential equation of the form 
Lf lx] + Awl] fled = (7.73) 


we must recognize that a zero of p[x] is a singular point of the differential equation and 
that conditions, such as finiteness, will be needed to handle the singularities. If p[x] were 
to display interior roots, we would have to have to apply Sturm—Liouville methods to the 
intervening regions separately and hope that some method could be devised to bridge the 
singular points. However, space does not permit a more general exploration of irregular 
Sturm—Liouville problems. 

Similar methods can often be used to demonstrate that an operator is self-adjoint on an 
infinite domain, either 0 < x < œ or —co < x < oo. Under these conditions one usually 
needs p > 0 as x > too and the appropriate boundary conditions require the solution to 
remain finite in order to ensure that the integrated terms vanish and that the operator is 
self-adjoint. Similarly, one can extend many of the properties developed here to operators 
that satisfy 


b b “ 
f FHOglr) dv = (f eDay) (7.74) 


in two or more dimensions. The integrated terms encountered in the demonstration that the 
inner product is self-adjoint with respect to suitable boundary conditions are then described 
as surface terms. If the surface terms must vanish for any pair of functions (f, g) that satisfy 
the boundary conditions, then D is self-adjoint with respect to those boundary conditions. 

The Sturm—Liouville operator is just one example, albeit an important one, of a self- 
adjoint operator. A Sturm—Liouville system is defined by its operator, £, or equivalently 
its coefficient functions p[x] and q[x], its boundary conditions 8, and 8,, and its weight 
function w[x]. Many of the important special functions in physics originate in Sturm- 
Liouville systems. 


7.3.2 Reality of Eigenvalues and Orthogonality of Eigenfunctions 


An eigenvalue problem is defined by 
Du;[x] + A;w[x]u,;[x] == 0 (7.75) 


where D is a self-adjoint operator and u, € S satisfies self-adjoint boundary conditions 
at the endpoints of (a, b). Generally one can satisfy both boundary conditions simultane- 
ously only for very particular values of A,, called characteristic values or eigenvalues. The 
corresponding solution u, is then called a characteristic function or a normal mode or an 
eigenfunction. Consider two eigenfunctions satisfying 


Du,[x] + A;w[x]u,[x] == 0 => u[x]"Du,[x] == A wLx]u;[x]*u,Lx] (7.76) 
Du [x] + A w[x]u [Lx] =0= ula Du [x] == ~A wix]u; lx] u Lx] (7.77) 
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such that 


b b * 
f u lx] Du;[x] dx -— (f u;lx] Du lx] as] 


b b * 
=- f ubTuweddr (a, f uxu ots ds) 


b 
= (Aj — w f u [x] u;lx]wix] dx (7.18) 


where the left-hand side vanishes for any self-adjoint D and where we use the fact that the 
weight w is real. Thus, recognizing that the normalization integral 


b 
(lwl) = f u;[x]*"u;[x]w[x] dx = 0 (7.79) 


a 


is positive-definite for any nontrivial (i.e., nonzero) eigenfunction, we find that the eigen- 
values must be real, such that 


i= j= àù =ù =} ER (7.80) 


Furthermore, if A; + A; the integral on the right-hand side must vanish, such that 


b 
À; + A; => u; lwlu,)=0= 1 u [x] u; [x]wlx] dx (7.81) 


Therefore, eigenfunctions with different eigenvalues are orthogonal with respect to the 
weight function. On the other hand, if there are m linearly independent eigenfunctions 
with the same eigenvalue, which is described as m-fold degenerate, it is possible to form m 
independent linear combinations of the form 


m 


g] = >, a, ju [x] (7.82) 


j=1 
that are mutually orthogonal (with respect to w), such that 
i+ j= (lwl) =0 (1.83) 


using the Gram-Schmidt orthogonalization procedure. The eigenvalues can then be obtain- 
ed from 
_ G1D19) 
‘ (6; |w 14) 
where it is not necessary to normalize the eigenfunctions. On the other hand, it is always 
possible and usually more convenient to construct an orthonormal basis satisfying 


(7.84) 


De lx] + Awe] = 9, (yg, | wl ~;) = Ôj (7.85) 


Note that it is sometimes convenient to distinguish between the orthonormal set (p,) and 
more primitive sets {@;} before normalization and (u,) before orthogonalization within 
degenerate subspaces. 
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7.3.3 Discreteness of Eigenvalues 


One can also show that the set of eigenvalues {A,} is infinite, bounded from below, and 
has at most one accumulation point at co. Thus, one can arrange the eigenvalues in an 
increasing sequence A, < A, < A -++ where we omit duplicated values. Often some of the 
eigenvalues will be duplicated several times; a value that is repeated m times is described 
as m-fold degenerate and then corresponds to m linearly independent eigenfunctions. We 
normally assume that the Gram-Schmidt orthogonalization procedure has been used to 
construct m linearly independent, mutually orthogonal eigenvectors for each degenerate 
eigenvalue. Degeneracies often result from a symmetry, such as reflection or rotational 
symmetry. 


7.3.4 Completeness of Eigenfunctions 


Suppose that [x] is an arbitrary piecewise smooth function in (a, b) with a finite number 
of discontinuities and that it satisfies the boundary conditions imposed upon a self-adjoint 
operator D. Let {y,,} represent the orthonormal eigenfunctions of L, such that 


Do, +A we, = 9 (7.86) 
b 
(Py IWI Pn) = f PaT Pnlxlwi] dx = ô, m (7.87) 
where w[x] is the weight function for this system. We can then expand 
lx] => a,¢, [1 (7.88) 
b 
a, = (P, lw |Y) = f P lx] ylx]wlx] dx (7.89) 


to any desired degree of accuracy. More rigorously, we define the mean-square error in a 
truncated expansion with N terms as 


b 
T) 


One can show that the mean-square error approaches zero as the number of terms increases, 
such that 


N 2 
yil- Ya, B| wddx (1.90) 


n=1 


lim Ay4 = 0 (7.91) 


This condition is described as convergence in the mean and is less restrictive than uniform 
convergence, but it suffices for practically any application in physics. Indeed, we cannot 
expect uniform convergence if y has a discontinuity; the error in the eigenfunction expan- 
sion at a discontinuity is sometimes called the Gibb’s phenomenon. Note that uniform 
convergence can be established for analytic functions w by comparison with a Laurent 
series and expansion of p, in power series. 
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Next consider the eigenfunction expansion of a delta function 


ólx-¿l =>) 4,9, 1x] (7.92) 

a, = f " glx] ôl - Ewi] dx = wE, IET (7.93) 
whereby 

dix- él = )) p, wE, [T° (7.94) 


Given that the weight function w[x] is positive-definite within the relevant interval, this is 
sometimes written in the more symmetric form 


six- £l = Dg, bday) whdwtely, [él (7.95) 


This somewhat peculiar expansion represents the completeness of the eigenfunction expan- 
sion. If we take a sufficiently large number of terms, the oscillations of y,[x] and ¢,,[€] 
delicately conspire, with the aid of the density function w[x], to assemble a good approx- 
imation to a delta function that vanishes everywhere except at the single point x = é but 
still has unit area. Thus, any function 


b 
n= | area: (7.96) 
represented as a convolution over a delta function can be expanded as 
b 
f= 1 Y galewlély, lE fE dE = Y apt] (1.97) 
where 
b 
a= (WIP = | oE EEIE (1.98) 


Notice that these properties apply to any self-adjoint operator — nowhere did we use the 
specific Sturm—Liouville form. Therefore, any self-adjoint operator D has real eigenvalues 
and a complete orthonormal set of eigenfunctions. 


7.3.4.1 Example: Fourier Series 


The Fourier series is represented by 


2 


d 
L= qe? w[x]=1, [0] = [L] (7.99) 
x 
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such that 
-1/2 : 2n7 
(L+A,w)y, = 0 = (,[x] = L! Explik,x], àp =K, k,= I (7.100) 


where n is an integer. The orthonormality and completeness relations take the form 


L 
(Pn l Pm? = { Expli(k,, ~ kx] dx = nee (7.101) 
0 
1 co 
dbx - £1 = > > Explik, (x — €)] (7.102) 


Thus, an arbitrary piecewise smooth function f[x] satisfying periodic boundary conditions, 
F[O] == F[£], can be represented by the discrete Fourier series 


fix] = fe Dit Explik,,x] (7.103) 
1 b 
a,= 4) 7 f Exp[-ik, x] f[x] dx (7.104) 


7.3.5  Parseval’s Theorem 


Suppose that 
yix => arena] (7.105) 
" b 
ay = (eq |wi) = | elvis (7.106) 


is expanded in a complete orthonormal set of eigenfunctions. The normalization of y is 
given by 


WwilwlW)= ie yix] y[x]w[x] dx = f 2 aP BT p, edw] dx (7.107) 
such that 
Wiwit => aP = yiwe Xp bw lw (7.108) 


represents a generalization of Parseval’s theorem for Fourier series to general Hermitian 
systems. 
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7.3.6 Reality of Eigenfunctions 


Although the coefficient functions for the Sturm—Liouville operator are real, one should 
not assume that the eigenfunctions are necessarily real. For example, the complex func- 
tions 


2 
p,[x] = 17? Exp[ik,x], k, = = (7.109) 
are eigenfunctions of 
(5 + Je, =0, 10] = ọ[L] (7.110) 


even though the eigenvalue equation is real. On the other hand, the conjugate functions 
(, [x]* are also solutions to the same equation 


Bo, 
(5 + An =0, e[0]* = p[L]" (7.111) 


with the same eigenvalues. The linearity of the equation permits one to form the real com- 
binations 


u, lx] = pilte bT = L"? Cos[k,x], 
ior (7.112) 
y. [q = Pat Pall _ 4 Sinfk x] 
n i n 
that also satisfy the same equations 
d 
E + eae =0, u,[0] = u,[L] (7.113) 
dx 
a 
& + jota ==0, v,[0] =v,[£] (7.114) 


with the same eigenvalues. Therefore, we may construct real orthogonal eigenfunctions for 
any self-adjoint differential operator with real coefficients. This property is often helpful 
in analyzing Sturm—Liouville systems. 


7.3.7 Interleaving of Zeros 


Consider two real eigenfunctions satisfying 
Ly, Lx] + wlxIw, lx] = 0 = 
vata (pix Lx) — ghey, bx) == A whey ll, lx] 
Lynx] + AwlxIw,[x] = 0 = 
Y, val (plx14,1x) - all, Lx) = -wiy [xv lx] 


(7.115) 


(7.116) 
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where A, + A,. Subtract these equations and integrate from a to x, such that 


dy, [t] a) - 


(up Y, l1]plr] (A, =À of wo lely, [1 we] dt (7.117) 


Self-adjoint boundary conditions ensure that the lower limit of integration does not con- 
tribute to the left-hand side, such that 


dy [x] me [x] 


Y, [x] 


rea =) = (A, - uf Yltyy [e]wle] dt (7.118) 


Suppose that we choose € to be a zero of w,, such that 


E 
PIEI] = (A, - uf yltyi [e]wlr] de (7.119) 


Both p and w are positive in (a, b). Further suppose that £ is the root of y, which is nearest 
to the lower endpoint a and that we choose the sign of y, to be positive for a < x < £, 
such that Wi[€] < O. The sign of the left-hand side is then opposite to the sign of plé]. If 
A, > A, there must be a sign change in y,[x] for a < x < é in order to obtain a negative 
value for the integral on the right-hand side. Therefore, larger eigenvalues produce stronger 
oscillations and a smaller spacing between roots of the corresponding eigenfunctions. 

This argument can be extended by integrating in the subinterval (x, x,) between two 
consecutive roots of y,, such that 


[pane -un R = (a) f valentine 000 
Integrating the original eigenvalue Ei over a subinterval gives 

Lola) + doled] = 0 = (pt ne) i =f (wl) = gle Dutt) at (7.121) 
It is useful to define 

gala] = Aw] — glx] (7.122) 
such that 

(» ey - = i i g lelyit] dt (7.123) 


Suppose that x, and x, are two consecutive roots of y. If g,[x] > 0 in (x,, x,), the slope 
increases in the interval such that y[x] has an approximately exponential behavior. On 
the other hand, if g,[x] < 0 in (x,,x,), the slope decreases, and W[x] has an oscillatory 
behavior. 
1d? 
Aw[x] > qlx] => (1) >0 (7.124) 


Py 
Awi] < q] = co T? >) <0 (7.125) 
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7.3.7.1 Example: Fourier Series 
The differential operator 


d 
de’ 


L F(0] = fir] = (7.126) 


is self-adjoint and the eigenvalue equation has solutions 
(L+A,)u,[x] = 0 = u, = Sin[nx], A, = n? (7.127) 
which satisfy the orthonormality condition 
(u, | Un) = f Sin[nx] Sin[mx] dx = son (7.128) 
0 


The roots are obviously interleaved. 


7.3.8 Comparison Theorems 


Suppose that u[x] and v[x] satisfy 
d d 
(Fpl + eb jul = 0 (7.129) 
d d 
(Spt +gb = 0 (7.130) 


where p[x] > 0 and g,[x] < g,[x] in the interval a < x < b. One can then show that there 
is at least one root of v[x] between any two roots of u[x] in this interval, such that v[x] 
oscillates more rapidly than u[x]. To prove this result, known as Sturm’s first comparison 
theorem, we argue by contradiction. First observe that 


diy ; 
geU v = uv’) = uv(g, — 81) (7.131) 
Next, suppose that x, , are two successive zeros of u[x] such that a < x, < x, < b. 
Integration of the preceding equation between the roots of u gives 
(plu'v - ww), = | uv(g, — g,) dx (7.132) 


* 


or 


PIX, lw [x vix] — plx Ju’ Lx, vix] = f uv(g, — g,) dx (7.133) 


If v does not have a root within this interval, we may assume without loss of generality 
that it is positive throughout the interval. Similarly, we may also assume that u is positive 
between its roots because the differential equations are linear and we are free to multiply 
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solutions by a constant that will achieve the desired sign. Thus, the right-hand side is 
nonnegative because g, = g, in this interval and vanishes if and only if g, = g, over the 
entire interval. However, the assumption that u is positive between roots at x, , requires 
that u’[x,] < O and u'[x,] > 0, such that the left-hand side is negative. Therefore, the 
assumption that v[x] does not change sign in this interval leads to a contradiction and we 
can conclude that v[x] must have at least one root in the interval x, < x < x. More 
generally, Picone’s modification states that if 


d d 
(frst + eLo ula == (7.134) 


d d 
(pole +s) =0 (7.135) 


where 0 < p, < p, and g} < g, ina < x < b, then Sturm’s first comparison theorem still 
applies. 

Recognizing that the Sturm—Liouville eigenvalue equation can be expressed in the 
form 


d d 
(¿rbd + stxl pid =0 (7.136) 


where g[x] = Aw[x] — q[x] with w[x] > 0, we immediately see that increasing A makes the 
solution y[x] more oscillatory. Suppose that y[x] satisfies 


d d 
(Spl E +w -alpi =0, ya] = 0 (7.137) 
and that u[x] satisfies 


d d 
(= Pas i ÀW min ~ x =0, ula] =0 (7.138) 

dx dx 
where the weight functions are replaced by their maximum or minimum values within the 
interval a < x < b, as indicated. The solutions for u are simply sine functions with period 


T =27 | —Pmax (7.139) 


max 
ÀW min 7 Imax 


and, according to Picone’s modification, there is at least one root of y between successive 
roots of u. However, because the period T ax decreases as the eigenvalue A increases, it is 
clear that the spacing between roots of y decreases as A > oo. The maximum separation 
between zeros of y is T,,,/2. By applying a similar analysis to 


d d 
(= Pain + ÀW max a dmn víx]=0,  v[a] = (7.140) 
dx dx 


we find that the minimum separation between zeros of y is given by T in/2 where 


Pmin 


max — fmin 


Trin = 2 
min T Aw 


(7.141) 
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Therefore, if x, is the n-th zero of y it must be found in the interval 


—_Pmin — cx -a<n | __ (1.142) 
ÀW max — Amin ÀW min — Umax 


The interval we found for x, can be used to deduce bounds upon the corresponding 
eigenvalue. Suppose that the boundary conditions require y[a] == y[b] == 0 and let x, > b 
and 1 > A, > 0. Simple manipulations then provide the bracketing conditions 


1 2 1 2 
(E) Pain + dmi <A <— (E) > +q (1.143) 
b-a mın mın n b-a max max 


W max W min 


Although the numerical factors may be different, similar brackets can be deduced for more 
general boundary conditions and often provide useful bounds when exact eigenvalues are 
difficult to obtain or when very precise results are not needed. Such bounds can also pro- 
vide limits and starting conditions for numerical methods of computing eigenvalues. Fur- 
thermore, for large n one finds 


2 2 
n = oo = Prin (E) <a, < Pass ( =| (7.144) 
b=a w b-a 


W max min 
and concludes that A, scales with n? for large n. This scaling is a general property of 
Sturm—Liouville systems that is independent of the specific boundary conditions. Notice 
that the spacing between eigenvalues cannot be made infinitesimal unless the interval is 
infinite. Thus, the spectrum of eigenvalues might be continuous for an infinite interval, 
but is discrete for a finite interval. Nevertheless, the eigenvalues increase without limit, 
eventually scaling with n?. The lowest eigenvalue has the smallest number of nodes within 
a < x < b that is consistent with the boundary conditions and increasing n increases the 
number of nodes. Therefore, one can index the eigenfunctions according to the number 
of nodes. When several eigenvalues are degenerate, a second index might be required to 


distinguish between orthogonal eigenfunctions with the same number of nodes. 


7.4 Green Functions 


We have already seen that Green functions provide a powerful method for solving linear 
inhomogeneous equations in which the output of a system can be expressed as a con- 
volution of the input with its response to a point source. Many physical systems can be 
represented in terms of self-adjoint operators. Therefore, it will often be useful to develop 
representations of the Green function in terms of the eigenfunctions for self-adjoint sys- 
tems. In this section we develop a couple of these methods. 


7.4.1 Interface Matching 
Suppose that 


d d 
j= + (ot =] - glx] (7.145) 
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with suitable boundary conditions is a Sturm—Liouville operator that is self-adjoint with 
respect to the density function w[x] on the interval a < x < b. We seek a Green func- 
tion G,[x, £] that satisfies 


(L + Aw[xDG, Lx £] = ôlx - 6] = 


d d (7.146) 
(E(P) ab + awts] Gls El = ole - 41 

dx dx 
with the same boundary conditions. The subscript acknowledges the parametric depen- 
dence of the Green function upon A. We expect G,[x, €] to be continuous in value, but the 
delta function produces a discontinuity in slope. The magnitude of this discontinuity is 
determined by integrating the differential equation across the interface 


br q d 

{ (= (ria) — qlx] + dla] }Obe éļdx = 1 (7.147) 
such that 

lim PIEGA IE + e, £] - Gilg - 8, €) = 1 (7.148) 


where p, q, and w are continuous. These conditions can be expressed more compactly as 


1 
AGrlxé6]==0, AGilx é] = ea] (7.149) 


where A represents the change across the interface. 
Further, suppose that we possess two independent solutions to the simpler homoge- 
neous problems 


(L + Away, [x] = 0, By, [x] = 0 (7.150) 
(L+ Away [x] =0, B,y,[x] = 0 (7.151) 


with a common parameter A such that the Wronskian 
W, [x] = yay ed] — y, Ply Lx] (7.152) 


is nonzero throughout the region a < x < b. Notice that these solutions do not need to 
satisfy both boundary conditions simultaneously; y, satisfies the lower and y, the upper 
boundary condition. The equation for the Green function is also homogeneous in the 
two separate regions x < € and x > &. Thus, we seek a Green function with piecewise- 
continuous representation 


x < E = Gl €] = AlEly, [x] (7.153) 
x> é => G ix él = BlEly, [x] (1.154) 
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that automatically satisfies both boundary conditions. The coefficients are determined by 
applying the matching conditions 


AG, |x, €] = 0 = Blély, [$] - Alély,[é] = 0 (7.155) 
1 1 

AG! == — [€] — A[€ly [€] == —- 7.1 

GyLx $] nel => BIE [é] — Alélyalé rel (7.156) 
at the interface, such that 
y, 15] YalS] 

Alé] = —R———, Blél = == 7.157 
ana 78! ama id 


Therefore, we finally obtain the Green function in the form 


Yale ly, [ry] 
PIEIW\ 18] 


where x_ is the smaller and x, is the larger of x and €. One can show that the product 
PIE]W,[€] is actually independent of € for Sturm—Liouville systems; that exercise is left to 
the reader. It is then clear that a Sturm—Liouville Green function satisfies the reciprocity 
condition 


G,[x, €] = (7.158) 


G lix £] = G Ié, x] (1.159) 


showing that the effect of a point source at € on the response at x is the same as the effect 
of a point source at x on the response at €. 
The solution to a more general inhomogeneous problem of the form 


(£ + Aw[x])W[x] = wlx] fl] (7.160) 


with the same boundary conditions can now be expressed in the convolution form 


b 
v= f Gylx, El fle wie] dé (7.161) 


where it is not necessary to include a solution to the homogeneous equation because the 
Green function incorporates the boundary conditions automatically in its very construc- 
tion. If we divide the convolution into lower and upper regions, the convolution integral 
takes the form 


x b 
Vox] = (PRIME) [oss S veteirtentel de yd S IEEE ae) 
(7.162) 
where pW, can be extracted because it is constant. 


This method fails if W, [£] = 0. A Wronskian vanishes when two solutions are not lin- 
early independent. Suppose that y, and y, satisfy the boundary conditions at both endpoints 
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simultaneously. Under those circumstances, A = A, actually represents one the eigenvalues 
of the Sturm—Liouville problem represented by 


(L+A,wixDe, [x] =0, Bp,[x] == 0 (7.163) 


and the Green function G, does not exist. We will return to this problem again later. 
Similarly, the method also fails if plx] = O within the interval. The roots of p[x] are 
singular points for the underlying differential equation. Hence, Sturm—Liouville methods 
are generally limited to the intervals between singular points and it can be difficult to 
connect adjacent intervals. Although special care may also be needed if p[x] vanishes at 
one or both of the endpoints, we will assume that p[x] is nonzero in the interior. 


7.4.1.1 Example: Vibrating String 


The Green function for a vibrating string clamped at both ends satisfies 


d 
(5 + e)a é)=6[x-€], G[0,Él == G[1,£] = 0 (7.164) 


Rather than apply the general formula, let us apply the method of interface matching 
directly. On the two sides of the interface we write 


x< € => G; lx, €] = aSin[kx] (7.165) 
x > ëE = Glix €] = bSin[k(x - D] (7.166) 


and identify the discontinuity in first derivative as 


lim(Gilé + e, £] — Gil - e, £1) = (7.167) 
to form the equations 

0 == bSin[k(é — /)] — a Sin[ké] (7.168) 

1 == k(bCos[k(é — 1)] — a Cos[ké]) (7.169) 


Thus, we obtain 
Sin[kx] Sin[k(é — ))] 


= 7.17 

eee Gnd] ical (7.170) 
Sin[k€] Sin[k(x — /)] 

= 7.171 

x>& => Gylx £] KSin[k/] ( ) 
These expressions can be combined into the more compact form 
in[k. in[k(x, —/ 

aisge Sin[kx_] Sin[k(x, — D)] (7.172) 


k Sin[k/] 


where x_ is the smaller and x, is the larger of x and €. The solution to the more general 
inhomogeneous problem 


d 
(E + ejes = fix], y[0] =y[1] = (7.173) 
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now takes the form 


1 X 1 
wx] = sare Dm] f Sin[ké] fE] dE + Sin[kx] f Sinfk(é — ISIE] ae) 
in[X/] 0 x 
(7.174) 
Notice that the Green function does not exist when 
k> E => Sin[k/] = 0 (7.175) 


for integer n. Thus, the Green function does not exist when k = k,, is one of the eigenvalues 
of the homogeneous problem. 
The present solution does, of course, agree with the general formula. Identifying 


p=l, q=0, w=0, 1=P£ (7.176) 
and 
y, = Sin[kx] 
y, = Sin[k(x —)] => W,[x] = K(Sin[kx] Cos[k(x — 1)] — Sin[k(x — 1)] Cos[kx]) 
= k Sin[k/] 
(7.177) 


we would have written 


cris y, Lx. ly, [x5] E A EA =D] (7.178) 
pilm Ié] k Sin[k/] 
without being the wiser for it. Once the method is familiar, one can usually apply it rather 
quickly and in so doing may notice important special features of a particular problem — 
it is usually better to apply the method to each problem rather than simply plug into the 
formula. 

It is instructive to examine the evolution of G, as k increases. Figures 7.1-7.4 display 
G¿[x, €] for several values of k midway between adjacent eigenvalues. The cusp along the 
x == é diagonal is responsible for the characteristic LG == 6[x — €] behavior that defines a 
Green’s function and is clearly evident for small k. In fact, in that limit we obtain 


x, -1 

k > 0 = Gilx, El > ee (7.179) 
However, as k increases, oscillations off the main diagonal become important also. 
7.4.2 Eigenfunction Expansion of Green Function 
Suppose that 

d d 
== Ka ae 7.1 
£= £ (r£) -ab (7.180) 


with suitable boundary conditions is self-adjoint with respect to the weight function w[x] 
on the interval a < x < b and let € represent a source point within that interval. The 
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Figure 7.2. G,[x, €] for the vibrating string with k = 37/2 and/ = 1 (arbitrary units). 
response of a Sturm—Liouville system to a point source is described by a Green func- 
tion G,[x, €] that satisfies the differential equation 

(L + Aw[x])G[x, €] == ólx-¿] = (7.181) 


d 
—(p[x]Gi lx, ED + Awlx] — q[x1)G, Lx, $] == Ólx — é] (7.182) 
dx 


248 7 Sturm—Liouville Theory 


SS 
SES: 
NY 
SN 


Figure 7.4. G,[x, €] for the vibrating string with k = 77/2 and / = 1 (arbitrary units). 


with the same self-adjoint boundary conditions. Here A is treated as a fixed parameter. Let 
[p, [x]) represent a complete orthonormal set of eigenfunctions that satisfy 


(L +2, wlxDe, [x] == (7.183) 


b 
(Ln |w | Om? = Ön > f Pnl Pm [xIwLx] dx = De (7.184) 
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subject to appropriate boundary conditions. If we substitute an eigenfunction expansion of 
the Green function 


Gls El = ce, 1x] (7.185) 


n 


into its defining differential equation, we obtain 


Y e, (L + Awg, Ex] = dix — El (7.186) 


n 


This can be simplified considerably by using the Sturm—Liouville equation for ọ,„[x] to 
write 


> c, (A — A, why, [x] == Ó[x — é] (7.187) 
Provided that A does not coincide with any of the eigenvalues {A,}, we can isolate the 
coefficient c,, by multiplying both sides by y,,[x]* and integrating over the interval (a, b) 
to obtain 


b 
f 3 C (A AO AF, law] dx == pp lié] (7.188) 
such that 
_ OmplóY 
A+ An = Cm = =A. (7.189) 


Therefore, the Green function takes the form 


Lele, 161 


142, => Grl1, Él = >) an A (7.190) 


where eigenfunctions (,, with A, near A tend to contribute most strongly. As a sanity check, 
we apply the operator £ + Aw to both sides 


(L+ wkd ybs E = Y AL + awo, 


AG 
= Y e =A 
a ald (7.191) 


=>) whey, boe, (T° 
= 0lx E] 


and recover the completeness relation expressed as an eigenfunction expansion of the delta 
function. 

Notice that the reciprocity condition, G,[x, €] = G,[€, x], is apparent in this represen- 
tation also. It is sometimes convenient to include a symmetric function of (x, €) related 
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to the weight function on the right-hand side of the equation for the Green function. For 
example, in spherical coordinates one might employ 6[r — €]/ré on the right-hand side. 
Other times it is useful to include normalization factors, such as —4z for electrostatics. As 
always, it is better to learn the method and to apply it to each particular problem rather 
than to employ the general formulas derived here. 

Now consider the more general situation in which y[x] satisfies an inhomogeneous 
equation of the form 


(£ + Aw[x])W[x] == wx] fix] (7.192) 


where the boundary conditions for y[x] make L self-adjoint with respect to w[x] on the 
interval (a, b). It is useful to expand both f and y in terms of the eigenfunctions {y,,} of £ 
according to 


b 
fol =D fed = f, =, lw f) = f 6, xl" fixlwix] dx (7.193) 
b 
val =p, Ll = Y, =, 1w1y) = Í p ixl yixwix] dx (1.194) 
such that 
Y y, L + we, ll = weno] (7.195) 


Use of the eigenfunction equation then gives 


Lg, 12] = -Awale => Y 4,02, whey, 121 = WaS fi? (7.196) 
and we project the expansion coefficient Y, by multiplying both sides by ¿,, [x]* and inte- 


grating over (a, b) to obtain 


A fa 
A-A,  A-A 


b 
o eee f 6, LF flx]w[x] dx = (1.197) 


n 


where the coefficient of y, should be recognized as the overlap (inner product) of the 
driving term f[x] with the normal mode y,,. This result can be represented in the form of a 
convolution integral 


b 
fut p lEI flélwlél dé 


(7.198) 


b 
A +à, = yh] = f Gala, él flélwlél dé = Y” 


where G,[x, $] describes the response at x produced by a source at €. Notice that it is 
not necessary to add a solution to the homogeneous equation because the Green function 
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already incorporates the boundary conditions. To verify this solution, we simply apply the 
operator (£ + Aw) to both sides 


b 
(£ + Aw>xDYLx] = f (E + Awlx)G Lx El [été] dé 
a (7.199) 


b 
: f Sx — EI flElwlél dé = fiwe 


and recover the original equation. Therefore, the Green function G,[x, €] offers a simple 
and insightful solution to practically any inhomogeneous equation of this kind provided 
that A is not one of the eigenvalues of £. 

More care is needed when A does match one of the eigenvalues because the Green 
function does not exist under those circumstances. If A is close to one of the eigenvalues, 
such that A = A,,, the eigenfunction expansion can be approximated by a single dominant 
term 


m? 


Ax A,, = Ylx] = — [x] + BLx] (7.200) 
that diverges as A > A,, and a smooth background contribution. In fact, for fixed x it is 
useful to think of y as a function of A. The eigenvalues of £ then correspond to the poles 
of y with residues f,,y,,[x]. Often these poles represent resonances or normal modes of 
the dynamical system and the coefficients f, represent the coupling of the driving term to 
those resonances; in other words, f, measures the strength with which the driving term can 
excite mode n. The nonresonant background 


Le A, = Bl] =D) 5 h 


nm ™ n 


Lx] (7.201) 


represents the contributions from all other more distant resonances and is usually much 
weaker. The response of the system is very strong near its resonances unless the driving 
term is orthogonal to the normal mode, such that f,, = 0. Thus, if A = A,, and f,, = 0, we 
can write a formal solution 

Sn 


A= Aw (Ln | f) =0 => yix] = Cm Pl] + » A —A 


nm ™ 


p, [x] (7.202) 


where c,, is an arbitrary constant. However, solutions of this type are inherently unstable 
and probably not very useful because we cannot determine c,, or keep it constant — any 
perturbation of the physical system or any numerical error in the mathematical computa- 
tion, no matter how small, would permit an uncontrollably large contamination by ¢,, [x]. 
A small perturbation ôf yields a small but finite overlap € 

En 


A= An 


frftof, xà, Sows p,, [x] (7.203) 


where 


b 
En = f Oj A Of lx]wlx] dx (7.204) 


a 
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The values of e, or A,, often exhibit small but random fluctuations in time, leading to large 
unstable fluctuations of the output when A is nearly resonant. Alternatively, numerical 
methods may amplify round-off errors when A is near an eigenvalue. Therefore, although 
the eigenfunction expansion of the Green function provides a valuable formal solution, 
considerable care must be exercised near any of the eigenvalues. 


7.4.3 Example: Vibrating String 


The Green function for a vibrating string clamped at both ends satisfies 


P 
(5 + a)atx é] = 6[x-—€],  G[0, €] == G[/, €] == 0 (7.205) 
x 
The eigenfunctions for the corresponding homogeneous equation 
d 
(5 + 3 pplx]=0,  ,10] = ¢,[/] = 0 (7.206) 
are 
N nn 
vy, [x] = ie Sin{k, x], k, = T (7.207) 


with eigenvalues A, = k2. Thus, the eigenfunction expansion of the Green function takes 
the form 


AAC J Sin[k,x] Sin[k,,¢] 
G,lx, $] = A-A = 1 K-K 
n=0 n n 


(7.208) 
n=0 


It is not immediately obvious that this result is consistent with our earlier findings using 
interface matching, Eq. (7.172). To demonstrate equivalence, we perform an eigenfunction 
expansion of the previous result. It is convenient to factor out the normalization factors, 
such that 
JE 1 
G, [x €] = 7 Sc, Sin[k,x] with c, = f Sin[k,, x]G,[x, $] dx (7.209) 
n=1 0 
The integrals can be evaluated by hand using various trigonometric identities in a tedious 
but straightforward manner, with a result 


Sin[k (£ - 1)] 
k Sin[k /] 
Sin[k £] 

k Sin[k 7] 

Simpli fy [#, n € Integers] & 


Integrate [sin [774] Sin[k x], {x, 0, ex] + 


Integrate [Sin [Ex] Sin[k (x-1)],4x,€, 12] // 


1? Sin [259] 
-n?n + 272 


that agrees with the present result. (Don’t try this at home!) 
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7.5 Perturbation Theory 


Often one wishes to investigate the effect of a small change in the dynamics or one can 
divide a differential operator into a solvable part and a small correction or perturbation. 
Suppose that 


ARETA (7.210) 


where £,, represents a dominant and solvable part while £, represents a correction scaled 
by the small parameter e. Let {g©} represent a complete set of eigenfunctions for Lo 
satisfying 
(Lo + Aw EDPO = == () (7.211) 
(ge | w 1g) = (7.212) 


Ôn, n 


and let {p„} represent the eigenfunctions for the complete system, such that 
(L + Aweke, = 0 = (£, + A, wlxDop, = -eL 9, (7.213) 


Treating € £, p, as an inhomogeneous term, this equation can be solved, at least formally, 
by using the Green function for £5, such that 


b 
p, [x] = -f cV Ix, ElL Elp, 16] dé (7.214) 
where 
OAOE 
(Ly + Ala DGA Lo El = die El => Gu Lx, El = Y Pm a l (7.215) 


m Am 


. (0) . A . 
expresses the unperturbed Green function GU in terms of unperturbed eigenfunctions po 
and eigenvalues A), Note that we assume implicitly that £} shifts the eigenvalues slightly 
so that A, — A œ e and does not vanish. We can now write this formal solution as 


(e | Li 1¢,) 
gle] =-€ >) ol Ex] et (7.216) 
where 
(go? IL 19.) = i gO Ll Lig, [x] dx (7.217) 


represents a matrix element of £, that couples q, to gp, However, this formal result is 
not especially useful because y,, appears in the right-hand side as part of an infinite series 
of integrals that we cannot evaluate without already having the solution we seek. Nor do 
we know A, yet. 
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The fundamental assumption of perturbation theory is that changes in the eigenvalues 
and eigenfunctions can be developed as power series with respect to the small parameter e, 
such that 


00 00 


A Co", A OP (7.218) 


m=0 m=0 


Thus, the first-order eigenfunction can be expanded in terms of unperturbed eigenfunctions 
as 


Pr =o +e) co? (7.219) 
m+n 
such that 
e> 0 = p, > 9 (1.220) 


provided that the expansion coefficients c 
convenient to employ the normalization 


(0 we.) =1 (7.221) 


and to exclude y©) from the correction term; attempting to maintain unit normalization 
leads to unnecessary complications. Substituting this approximation into the eigenvalue 
equation gives 


„m remain finite, as expected. Notice that it is 


(Ly + eL, +A,whx)y, = 0 = (7.222) 
(EL, +A, -Away ~ -e> (EL, + A, — AP whee? (7.223) 
m+n 
or 
Li + ADD =- X DA, AD wd (7.224) 
m+n 


where terms of order e? have been dropped. Multiplying both sides by y and integrating, 
we obtain the first-order correction to the eigenvalue 


AD = ye | L 1 ge) (7.225) 


in terms of a diagonal matrix element of £, that uses the unperturbed eigenfunctions. This 
can be evaluated because the {yg} are presumed known. Similarly, we substitute the first 
approximations for the eigenfunctions and eigenvalues into the formal solution to obtain 


(0) 
pk [x] 
ce x rl Ka (ye 1£, 1g) +e > Ope’ 1 L£,1¢)] (7.226) 


m+n kn ^n k mn 
Once again we drop higher-order terms and use orthogonality to isolate the coefficient 
© | L ly®) (OL, le 

ay — Ym Lile (0) y oem |L leh 

m +n => Ch, mo 10 _ 10 > Pn +€ Pm 40) QO 10 


m n m+n m 


(1.227) 


in a form that is also calculable directly. 
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It should be clear that this procedure can be iterated indefinitely, substituting the cur- 
rent approximation back into the formal solutions to obtain corrections to the eigenfunc- 
tions and eigenvalues to one higher power of e. However, the procedure becomes more 
complicated if some of the unperturbed eigenvalues are degenerate because the denomina- 
tors involving differences between eigenvalues will vanish. The problem can be handled 
by forming, in degenerate subspaces, linear combinations that are diagonal with respect 
to £,, but we will not pursue degenerate perturbation theory here. 


7.5.1 Example: Bead at Center of a String 


Suppose that a string of length L under tension 7 with mass density o carries a small bead 
of mass m at its center. The normal-mode equation takes the form 


(4 ae” ky (1 + ox g 5)) ¢, Lx] =0, ¢,[0] = ¢,[L] == 0 (7.228) 


where u = m/o is a small parameter with dimensions of length; hence, we apply pertur- 
bation theory when u << L. Obviously, we identify the unperturbed solutions as 


2 
(5 + a O[x]=0, 9, [0] = p,[L] = 0 > (0 lx] = ie Sin[kKOx] (7.229) 


d 2 
where 
O-o |e (7.230) 
n n T L 
The first-order corrections to the eigenvalues are given by 
2 2 2 
Ko? = (es (0) yx = A gP) = -F sin) 7 | sE = Modin, 2] (1.231) 


where Mod[n, 2] is the remainder for n divided by 2 and has the values of 1 for odd n and 
0 for even n. Notice that the bead does not affect the frequencies for even n because those 
eigenfunctions have a node at L/2 anyway — if the bead is stationary, it might as well be 
absent. Combining with the zeroth-order eigenvalue and expanding with respect to u, we 
find 


k, = M - Ë Mod{n, 2] (1.232) 


Similarly, the perturbed eigenfunctions take the form 


dl _L 
` o y xO) (gO E [x Boae. go ye 2u T, Pl (7.233) 


m+n Mn m+n 
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Figure 7.5. 0, [x] for bead at x = L/2 in units where L = 1. 


or 


y _\(m+n)/2 Mod i 2 
P, = 9 + É Modin, 21 Y yoo adie) (7.234) 
T 


m-n 


mn 


and are unchanged for even n. Notice that the admixture of terms with m + n is pro- 
portional to (m — ny, so that nearby terms contribute most; nevertheless, this expansion 
converges relatively slowly. 

To illustrate the effects of this perturbation, it is useful to express our result in the form 


2 
y,lx] = ¿OL + Ep, [x] (1.235) 


Figure 7.5 displays the lowest several óp,[x] in units where L = 1. These functions are 
qualitatively similar to Green functions with central cusps produced by the small bead, but 
exist for k = k®. 


7.6 Variational Methods 


At the beginning of this chapter we derived the Sturm—Liouville equation using a varia- 
tional argument and here we would like to extend this approach to a variational analysis of 
its eigenvalues. Consider the functional 


le (pts a) £ gests?) dx 


Aly] = - 
f? WoPwtl dx 


(7.236) 
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where w is any continuous function that complies with appropriate self-adjoint boundary 
conditions and where p[x] and w[x] are nonnegative in a < x < b. This functional is 
known as the Rayleigh quotient. Since we have shown that both the real and imaginary 
components satisfy the same Sturm—Liouville equation, we may assume without loss of 
generality that y[x] is real and not use complex conjugates in the definition of A; this will 
simplify the algebra. First, we demonstrate that functions y that render A[y] stationary 
with respect to small variations w[x] > W[x] + 6W[x] consistent with the boundary condi- 
tions are eigenfunctions of the Sturm—Liouville operator. Recognizing that 

io aa PARA (7.237) 

B B? B 


and that B is positive-definite, we require 


OA = 0 => ôA — AÓB = (7.238) 


The variation of the numerator 


d doi 
5A =2 T (» ba ERI) -o + gles) dx (1.239) 


can be simplified using partial integration 


104 = (pri (“E ovt i) =| (Era ED - aora) byte ax (7.240) 


dx 


and the self-adjoint boundary conditions 


(rox a outs x) =0 (7.241) 
to write 

SA = -2 { pe - (pix 5) - glx] dx (7.242) 
The variation of the denominator 

ôB = of W[x]ow[x]w[x] dx (7.243) 
is already i Combining these terms now gives 

5A - ASB =0 = f (El (p EE) - gra + Ata) arta dx = 

(1.244) 


If this integral is to vanish for arbitrary 04[x], the coefficient of ôy[x] must vanish identi- 
cally, such that 


$ (pix ai 


=| — qlxly[x] + Aw[x]W[x] == 0 (7.245) 


258 7 Sturm—Liouville Theory 


Thus, if we identify A[y] with the eigenvalue A we recover the Sturm—Liouville eigenvalue 
equation 


dy, [Lx] 


X 


d 
E (p EE) ASS (7.246) 


Therefore, if A[y,,] is stationary with respect to small variations Y, > Y, + dw then y, [x] 
is an eigenfunction of the Sturm—Liouville operator with eigenvalue A, = A[y,,]- 
Suppose that 


00 


yla] => 0,6, 11] (7.247) 


n=1 


is expanded in a complete orthonormal set of eigenfunctions (p,,) satisfying 


b 
(Pm lwl py) = f Pmp, [x]wLx] dx = Ó,,, (7.248) 


a 


The numerator of the Rayleigh quotient then takes the form 


b 

d 

S oY + awe yas 
a X 


b 
= A CmEn f (pixle, xy, [x] + qe, [x], [x]) dx (7.249) 


m,n 


and can be simplified using partial integration of the first term and the boundary conditions 
to obtain 


2 . d : 
A 3 Ch = 7 y CmCn f onl Eome, Lx]) - qix], Lx) dx (7.250) 


m,n 


The term in parentheses can be simplified further using the Sturm—Liouville eigenvalue 
equation, leaving 


b 
Ay ==) et f 0, IFA, wLx]¢,,[x]) dx (7.251) 


m,n 


Thus, we find that 


2, 
ATW] = Lu CA, 2 Y l L£ | y) (1.252) 


2... (wlwly) 


where 


d d 
L= ¿Pdo - 4h (7.253) 


is the Sturm—Liouville operator. Therefore, the Rayleigh quotient can be interpreted as the 
average of the eigenvalues weighted by the intensity of the corresponding eigenfunction 
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in y. Here intensity is interpreted as the square of the expansion coefficient and represents 
the contribution of that term to the normalization integral for y, as given by Parseval’s the- 
orem. This result should be familiar from Fourier analysis but applies to any self-adjoint £. 

Our analysis of the interleaving of roots demonstrated that there exists a smallest eigen- 
value; hence, the smallest eigenvalue must represent an absolute minimum for A[y]. Sup- 
pose that W[x] is a trial function that is intended to approximate y,[x]. The trial function 
can be formally expanded in terms of eigenfunctions as 


Wx] = pi] + 0,6, Lx] (7.254) 


n=2 


where each a, is small if y is a decent approximation to y,. Notice that we are free to use 


(lwly=l (7.255) 


because A[W] is independent of normalization. Then 


A Nah 
Aly] = A Lasa taa y 2, + Y ala, - (7.256) 
1+ Dña a n > 


demonstrates that the error in the approximation to A, due to the error (y — y,) is second- 
order with respect to a„. Thus, even if the trial function is not especially good, the estimate 
A, = Aly] may still be pretty accurate. Furthermore, the true value is smaller than that 
obtained with any trial function (other than ¢, itself). Therefore, if we construct a trial 
function that resembles p, based upon intuition and which includes a parameter, mini- 
mization of A[y] with respect to that parameter should produce a good approximation 
to A,. Naturally, a better trial function provides a better estimate of A,. This procedure is 
known as the Rayleigh—Ritz method. 


7.6.1 Example: Vibrating String 


The almost universal first example of this technique is the vibrating string. Although we 
already know the eigenvalues and eigenfunctions for 


d 
(5 + 4) Pa =0, ¢,[0] == ¢,[L] == 0 (7.257) 
with Rayleigh quotient 
L 
y [xP dx 
Aly] = y Abra (7.258) 
h vl? dx 


suppose that we feign ignorance and employ a zero-parameter trial solution of the form 


yix] = x(L — x) (7.259) 
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which satisfies the boundary conditions and is symmetric with respect to L/2, as expected 
for the lowest eigenfunction. Thus, 


E [E - 2) dx 10 


ps x (L -xf dx OP 


W' [x] =L- 2x = Aly] 


(7.260) 


is only 1.3 % larger than the exact answer of 71? /L?. A better approximation can be obtained 
by including a parameter. Consider, for example, a trial function of the form 


L 2 
yix] =x(L-x) (1+2(x- =) ; 
Although the evaluation of A is straightforward, we will use MATHEMATICA to perform that 
unenlightening task 


, 2 
ae Integrate [y Be , Tx, 0, L}] //Simpli fy 
Integrate [y[x]?, {x, 0, LY] 


6 (560 + 56L%a + 111*a?) 
L? (336 + 241%a + L“a?) 


and then minimize A with respect to the parameter a, to obtain 


sol = Solve [3,4 = 0, a] 


NETA 


13L? 13L? 


Å J acid /N 
x 


1.00001 10.348 
{ E TAE } 
Obviously one of the solutions corresponds to a maximum and should be rejected, while 
the other brings us within one part in 10° of the exact answer. Of course, a poor choice of 
trial function, such as the addition of a term that is odd with respect to the midpoint, would 
bring little or no improvement in the variational estimate. By respecting the symmetry of 
the problem, this judicious choice of trial function offers a very accurate estimate. 
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1. Laguerre polynomials 

Suppose that L, [x] is a polynomial of order n and that polynomials of different order are 
orthogonal with respect to the exponential weight function w[x] = e”* on the interval 
0 < x < oo, such that 


f L IL, Le” dx = 5, n (7.261) 
0 
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a) Construct the lowest three Laguerre polynomials using orthonormality conditions. 


b) Alternatively, show that these functions satisfy a Sturm—Liouville equation that can 
be expressed in the form 


xy” [x] + glx]y’[x] + A, yx] == 0 (7.262) 


and demonstrate that the eigenvalue must be a positive integer for the corresponding 
eigenfunction to be bounded as x > ov. 


2. Minimization of mean-square error 
Suppose that a piecewise smooth function f[x] is approximated by the truncated expansion 


N 


feed = > esp, hd (7.263) 


m=0 


where the eigenfunctions {y,,} are orthonormal with respect to the weight function w[x] on 
the interval a < x < b, such that 


b 
f Pm "O, Lx]wLx] dx = ôn p (7.264) 


Show that the mean-square error 


b 
él, = a Ile] = fybPwhx] dx (7.265) 


is minimized when 


b 
Cm = f o LJ" fix]w[x] dx (7.266) 


independent of N. Note that this independence of the optimum coefficients c,, from the 
number of terms N is a special property of eigenfunction expansions that would not be 
satisfied if the coefficients of arbitrary functions, such as x”, were determined by least- 
squares fitting. (Hint: consider independent variations of c,, and c;,.) 


3. Intermediate boundary conditions 
Show that a regular Sturm—Liouville operator (with pla] + O and p[b] + 0) is self-adjoint 
in a < x < b for any nontrivial intermediate boundary conditions of the form 


ay fla] + a, fla] == 0 (7.267) 
Bof [bh] + B, f'Tb] = 0 (7.268) 


where the coefficients «œ; and £; for i = 0, | are real. 
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4. Eigenfunctions for string with intermediate boundary conditions 
The eigenfunctions for a vibrating string with intermediate boundary conditions satisfy 


A 
(= + 4) p, [x] == 0 (7.269) 
aop, [0] + a, p,, 10] = 0 (7.270) 
Bob, lL] + Pp, [E] = 0 (7.271) 


Derive a transcendental equation for the eigenvalues k,, and discuss a graphical procedure 
for determining numerical values. 


5. Sturm-Liouville Wronskian 
Suppose that y, and y, are two solutions to the Sturm—Liouville equations 


(£ + Aw[x])y, [x] = 0 (7.272) 
(£ + Aw[x])y,[x] = 0 (7.273) 
where 
d d 
£= T (a) — qlx] (7.274) 


Evaluate their Wronskian 


Wix] = y, bely be] — yayi [x] (7.275) 
and show that p[x]W [x] is constant. (Hint: evaluate W’[x].) 


6. Bounds on eigenvalues 

Consider the eigenvalue problem 
(1 +.2x°)y” [x] +20 [x] + ACL +1?2)y[x] == 0 (7.276) 
y'[0]==0, y[1]=0 (7.277) 


in the interval 0 < x < 1. Show that the lowest eigenvalue is in the range ai <A,< in 


7. Green function for vibrating string 
The Green function for a vibrating string clamped at both ends satisfies 


a J J $ i 

(5 +1J6.1 = 6[x—-x'], G,[0,x"] == G,[/, x] == 0 (7.278) 

a) Write down the formal eigenfunction expansion of G,[x, x’]. Under what conditions 
does the Green function exist? 


b) Demonstrate that the Green function can also be expressed in closed form as 
_ Sin[kx_] Sin[X(x, — 2] 


oe) k Sin[k/] 


(7.279) 


where x_ is the smaller and x, is the larger of x and x’. 
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c) Prove that these representations are equivalent. 


8. Green function for y” — k?y == f[x] with y[0] = y[L] = 0 
Consider the differential equation 


y'al- yb] = fix], yi0] = yL] = 0 (7.280) 
where k > 0. 
a) Construct a closed form for the Green function and write a general form for the solu- 


tion to the inhomogeneous equation. 


b) Construct the Green function as a Fourier sine series and again write a general form 
for the solution to the inhomogeneous equation. 


c) Demonstrate that these two representations are equivalent. 


9. Green function for stopped pipe 
Acoustical vibrations in a stopped pipe satisfy 


yl + yl] = fle], y'0]=0, yiL] = (7.281) 
a) Construct a closed form for the Green function and write a general form for the solu- 


tion of the inhomogeneous equation. 


b) Construct the Green function as an eigenfunction series and again write a general form 
for the solution of the inhomogeneous equation. 


c) Demonstrate that these representations are equivalent. 


10. Green function for open pipe 
Acoustical vibrations in an open pipe satisfy 


y "ld + yl] = fla, y'[0] = y [L] = 0 (7.282) 


a) Construct a closed form for the Green function and write a general form for the solu- 
tion of the inhomogeneous equation. 


b) Construct the Green function as an eigenfunction series and again write a general form 
for the solution of the inhomogeneous equation. 


c) Demonstrate that these representations are equivalent. 


11. Green function for periodic orbits 
Suppose that small-amplitude oscillations around a stable orbit in a synchrotron satisfy a 
differential equation of the form 


Y” [x] + wy[x] == fix], 0O<x<2x (7.283) 
subject to periodic boundary conditions 
y[0] = y[2x],  y'[0] = y'[27] (7.284) 


where f[x] is a localized disturbance. 
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a) Obtain the eigenvalues w,, and eigenfunctions ¢,,[x] for the homogeneous equation 
and write a formal expansion for its Green function. 


b) Obtain an explicit closed-form solution for the Green function. 

c) Write a formal expression for the solution to the inhomogeneous equation. 
d) Discuss the behavior of the system near a resonance with w = w,- 

12. Fredholm alternative theorem 


Suppose that you have in your possession the eigenvalues and eigenvectors {A> p,, [x]) that 
satisfy a homogeneous Sturm—Liouville problem of the form 


d d 
(L t A, WADE, 1x1 = 0, £ = q bd) E qlx] (7.285) 

x dx 

and wish to solve the inhomogeneous equation 
(L + A, WExDylx] = flx] (1.286) 


by expanding y in terms of y,,. The boundary conditions are homogeneous but for our 
present purposes need not be specified further. What condition must f satisfy for this 
strategy to succeed? Write a formal solution assuming that f satisfies your condition. What 
happens if f fails to satisfy this condition? The general solution to this problem is called 
the Fredholm alternative theorem. 


13. Green function for a 4th order equation 
Consider a linear fourth-order differential equation of the form 


E 
(= + ata =0, y10] =y"[0] = yin] =y"[1] = (7.287) 


a) Show that this equation is self-adjoint for the specified boundary conditions. 


b) Obtain the normalized eigenfunctions ¢,,[x] for 0 < x < 7 and corresponding eigen- 
values A,,. 


c) Construct an eigenfunction expansion for the Green function that satisfies 
d+ 
(4 + a) G [x x’] == Ó[x — x’] (7.288) 
x 


with the same boundary conditions. 


d) Use the Green function to produce a series solution to the inhomogeneous equation 


dł 
E + a) ots ==x,  y[0] = y”[0] = yir] == y” [r] == 0 (1.289) 
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14. Shifting the eigenvalue spectrum 
Suppose that 


d d 
L = Pibd a = (£, + a, DYE] = 0 en) 


has an eigenvalue spectrum (a,,n = 0,00) with a + 0. Construct a related Sturm- 
Liouville operator 


d d 
L, = T = qo[x] => (L£ + B wlx DY Lx] = 0 (7.291) 


that has the same eigenfunctions and a spectrum of eigenvalues (6,,) that is shifted with 
respect to {œ} such that 6) = 0. How are {p,, q», w») related to {p}, q w1}? 


15. An eigenvalue problem with mixed boundary conditions 
Show that the eigenvalue problem 


y"+Ay=0, y[0]=0, y'[0] =y[1] (7.292) 


with mixed boundary conditions has only one real eigenvalue and determine the corre- 
sponding eigenfunction. Why does this problem not have an denumerably infinite spec- 
trum of solutions? 


16. Normal modes for stretched string with one end attached to spring 

Suppose that a string of length L under tension T has one end fixed while the other is 
attached to a ring that slides without friction on a fixed rod and that the ring is attached to 
a spring with spring constant œ. Thus, the transverse displacement y[x, t] satisfies 


e A Be ra o Leg. O Pen = px, t] (1.293) 
where p is the mass density. 
a) Show that there exist normal modes of the form 
yix, t] = e f[k,x] (7.294) 


and determine the relationship between w, and the parameters of the problem (L, T, 
a, p). Discuss the special cases of very weak and very strong springs. 


b) Demonstrate that the normal modes are orthogonal. 


17. Hanging string 

A string of length L with uniform mass density o hangs under its own weight. Here we 
investigate small-amplitude transverse vibrations in a plane, assuming that the tension 7[x] 
is not affected by this motion. 


a) Evaluate the equilibrium tension T[x] and construct the normal-mode equation. What 
are the appropriate boundary conditions? 
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b) Use the substitution s? = 1 — x/L to show that the normal modes satisfy the Bessel 
equation and find an expression for the eigenfrequencies. 


c) Construct the general solution to the initial-value problem. 


18. Twirling string 

A string of length L with uniform mass density o is attached at one end to a thin rod 
that rotates with constant angular velocity Q. Here we investigate small-amplitude vertical 
vibrations neglecting gravity and assuming that the tension T[x] is not affected by trans- 
verse vibrations. 


a) Evaluate the equilibrium tension t[x] and construct the normal-mode equation. What 
are the appropriate boundary conditions? 


b) Show that the normal modes satisfy the Legendre equation and find an expression 
for the eigenfrequencies. Determine the lowest three eigenfrequencies and sketch the 
corresponding eigenfunctions. 


c) Construct the general solution to the initial-value problem. 


19. Clamped string with central bead 

Suppose that a string of length L under uniform tension 7 is clamped at both ends and that 
the mass density o is uniform except for a small bead of mass m affixed to its center. It 
is convenient to define the range as —a < x < a where a = L/2 and to define u = m/c. 
Neglect gravity. 


a) Construct the unnormalized eigenfunctions and deduce the equations satisfied by the 
eigenfrequencies. Compare with the ordinary string (m = 0) and explain why sym- 
metric and antisymmetric modes behave differently. 


b) Describe a graphical procedure for determining the eigenfrequencies for symmetric 
modes. Use this construction to compare eigenfrequencies for symmetric and anti- 
symmetric modes for large u or for large k. Derive a simple approximation for the 
splitting between symmetric and antisymmetric modes under these conditions. 


c) Sketch the first few symmetric eigenfunctions and evaluate the displacement of the 
central bead. 


d) Verify explicitly that the eigenfunctions are orthogonal with respect to 
w[x] = 1 + ó[x] (7.295) 
where u = m/o has dimensions of length. 


20. An upper bound on the first root of J [x] 
Evaluate the Rayleigh quotient A[@] for the eigenvalue problem 


y+ ~ +Ay==0, |y[0]| <0, yi] = (7.296) 


using the test function [x] = a(1 — x) and deduce an upper limit for the smallest root 
of JoLx]. 


8 Legendre and Bessel Functions 


Abstract. The properties of Legendre and Bessel functions are developed in consid- 
erable detail, including: generating functions, recursion relations, orthonormality, 
series and integral representations. These functions are important for a wide variety 
of physics problems based upon equations featuring the Laplacian operator and will 
play a central role in our subsequent study of boundary-value problems. 


8.1 Introduction 


Many of the special functions we employ in physics arise by applying in appropriate coor- 
dinate systems the technique of separation of variables to reduce partial differential equa- 
tions, such as the Laplace, diffusion, or Schródinger equations, to systems of ordinary 
second-order differential equations. With appropriate boundary conditions these differen- 
tial equations are often self-adjoint and their solutions exhibit the general properties of 
Sturm—Liouville systems. Other properties can be developed using integral representa- 
tions, which permit analytic continuation, or by using generating functions of the form 


Fit, x] = > fb (8.1) 
n=0 


where the coefficient in a power-series expansion with respect to one variable is a function 
of the second variable. 

There is a vast literature on the hundreds of special functions that have been studied 
during the last 300 years or so. Obviously, an exhaustive study is impossible within the 
confines of one chapter of a single-semester course; careers have been devoted to this 
topic. This author is not expert in this subject and is not capable of memorizing all of 
the relationships he employs frequently, much less those he does not, and expects that 
few readers would be willing or able to do so either; it is an important but dry subject. 
Instead, he relies on compendia like Abramowitz and Stegun or classic texts for details 
that are not readily at hand. Nevertheless, it is important to be familiar with several of 
the most important methods for studying such functions and their most important generic 
properties. For this purpose we will concentrate on those functions that are most useful for 
problems in electrodynamics, diffusion, and quantum mechanics at the first-year graduate 
level. Specifically, we will study Legendre and Bessel functions of several types. These 
functions will be used in the chapter on boundary-value problems to solve a variety of 
interesting and important physics problems. Our derivations here will not always provide 
all of the gory details, but some of the exercises will request the missing steps. 
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8.2 Legendre Functions 
8.2.1 Generating Function for Legendre Polynomials 


The Legendre polynomials P, [x] where x = Cos[6] arise often in the representation of the 
dependence of a physical quantity on a polar angle 9. Perhaps the most direct method for 
obtaining these polynomials is through the Green function for the Poisson equation 


y? O oe —An6[F — 7] (8.2) 


PP 


If r > r’, we may expand 


, py 2 -1/2 
l aU -22x+(4} (8.3) 
r 


P-r > r 


as a power series int = r’/r whose coefficients depend upon x = 7-7. Obviously, when r < 
r’ we employ a similar expansion with the roles of r and r’ exchanged. These expansions 
can be unified in the form 

— =r -2+ (8.4) 
Ir=F] 
where r_ = min[r, r’] is the smaller and r, = max[r, r’] is the larger of r and r’ and where 
t = r_/r, is in the range 0 < t < 1. Thus, the generating function 


1 co 
glt x] = 3 = Y Pa" (8.5) 
Vis Wed E 


exhibits a uniformly convergent power series with respect to t whose coefficients P,,[x] are 
identified as Legendre polynomials of degree n with respect to x. Once these functions 
have been constructed, the Green function for the Poisson equation becomes 


1 ar 
= an ” P, [x] (8.6) 
r-7'| 2 qe 
where r- is the smaller and r the larger of r and r’. 

Before proceeding with explicit construction of the Legendre polynomials, we obtain 


some of their basic properties directly from the generating function. Recognizing that 


gli +1] == 22" (8.7) 


a 
+ 


we find 


P [+1] = (+1) (8.8) 
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Similarly, using 
alt, —x] = g[-t, x] = P,[-x] = (-)"P, Lx] (8.9) 


we conclude that the parity of P, is (—)". Finally, using 


0glt, 
gl0,x]=1, el =) = (8.10) 
Ot t=0 
we find results for the two lowest Legendre polynomials 
PS). ax (8.11) 


that are consistent with the properties developed so far and which form the seeds for con- 
struction of a series by recursion. 

We can obtain recursion relations by differentiating the generating function with respect 
to either of its variables. Differentiation with respect to t gives 


Oglt, x] _ 1 
P: n- 
ðt = Bee 57 => [xj 
(8.12) 


00 


>i wð eom ls" => Pst dá 


=0 


where the last step uses the expansion of the generating function to eliminate fractional 
powers. Collecting like powers of t gives 


> "(n+ DP — Qn + 1)xP, + nP,_,) = 0 (8.13) 
n=0 


Notice that the coefficient of P_, vanishes, so that polynomials of negative order do not 
actually occur. If this relationship is to be satisfied for any value of t, the coefficients for 
each power 1” must vanish separately. Therefore, we obtain a three-term recursion relation 
of the form 


(n+ 1)P,,, - Qn+ 1)xP,+nP,_, =0 (8.14) 


that can be used to construct the entire sequence P, given Py = 1, P, = x. This is some- 
times described as a pure recursion relation because it does not involve derivatives. Direct 
calculation provides Table 8.1 and Fig. 8.1. Notice that each P, is a polynomial of order n 
containing only even or odd powers according to its parity. 

A recursion relation for derivatives P’ can be obtained by differentiating the generating 
function with respect to x, such that 


Og{t, x] 
= P! t" t= Pp p 
dx d- mB =>) = eee ¿Y lx] 2 ibe 
(8.15) 
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Table 8.1. Lowest few Legendre polynomials. 


n Plx] 

0 1 

l x 

2 4(-1 +327) 

3 $x(-3 + 5x’) 

4 iB - 30x? + 35x4) 

5 ix(15 — 70x* + 631%) 

6  4(-5 + 105x? — 315x* + 231x6) 

7 Lx(-35+ 3151? — 693x + 42935) 

8 2485 — 1260x? + 6930x* — 12 012x° + 643535) 


P,,[x] 


Figure 8.1. Legendre polynomials. 


Once again collecting like powers of t” we find 


Pai- PA +P =P, (8.16) 


n-1 ~ 
Using the previous recursion relation 


(n+ 1)P,,; — (2n + 1)xP,+nP,_, =0 
= (n + DP, -Qn+ IOP, +P,)+nP,_, =0 


n+l n 


(8.17) 
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we can eliminate P’ to obtain 


7 
Fras Pi 


i= n+ DP, (8.18) 


Many similar relationships can now be obtained by manipulation of these basic recursion 
relations. For example, by solving the simultaneous equations 


P! - P! =(Qn+ DP, (8.19) 

(n+ 1)P,,, +nP,_, = (2n + DP) + P,) (8.20) 
one finds 

Pl, =xP + (n+ DP, (8.21) 

P’_, =xP, —nP, (8.22) 


Multiplying by x and manipulating the index using n > n+ 1 


xP, = LP +nxP,_, (8.23) 
xP! = xX Pl. = (n+ Ps (8.24) 


Piri 


one obtains the so-called ladder relations 


(n+ DP = (n+ DxP, - (1-22) (8.25) 
nP,_, =nxP, + (1 -2)P! (8.26) 


that can be used to develop Legendre functions using either upward or downward recur- 
sion. 

Finally, the recursion relations can be used to derive a second-order differential equa- 
tion for P,[x]. We begin by expressing one of the ladder relations in the form 


(1 -3P = nP 


n-1 


- nxP, (8.27) 
Differentiating this expression 


(1 —x?)P” — 2xP’ = nP’_, — nP, — nxP’ (8.28) 


n-1 n n 


and eliminating P}_1, we finally obtain a second-order differential equation 

(1 — x) P” [x] — 2xP’[x] + n(n + 1)P, [x] = 0 (8.29) 
that is a special case of Legendre’s differential equation 

(1 = ¿ya - 2zy’[z] + vv + Dylal = (8.30) 


that applies when v => n is a nonnegative integer. Therefore, the regular solutions to 
Legendre’s equation for integer v > n and real z > x on the interval [—1, 1] are known as 
Legendre polynomials. More general solutions will be considered later. 

When discussing Legendre functions, it is convenient to use n or / to represent non- 
negative integers, x = Cos[6] to represent real numbers in the range |x| < 1, and to permit 
v or z to represent extensions of the degree or argument to complex numbers. 
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8.2.2 Series Representation and Rodrigues’ Formula 


Using the Taylor series 


£ c (2n)! 
(1 = z) 1/2 = sae (8.31) 
EQ 2" (n!) 
the Legendre generating function becomes 


(2n)! 


1 co 
glt 1] = === = Y > 
Vi-2t+P 2 2°"(n!) 


Then using the binomial expansion, we obtain 


Oey (8.32) 


= (2n)! r n! n=k (_¿23k 
La mY 2 rd m 


= 3 ‘Se (2n)! (2x pan 


2n 
n=0 k=0 2 n!k tn — k)! 


alt, x] = 
(8.33) 


An absolutely convergent double sum of this form can be rearranged using a theorem 
proven in the exercises that states 


o pP co Lr/2] 
2 Ži l4, p4 = >. As r25 (8.34) 
p=0 q=0 r=0 s=0 
where 
reven => [7/2] = 5 (8.35) 
r-1 
rodd = [r/2] = oe (8.36) 
is the floor function. After a little algebra we obtain 
co |m/2] 
(2m — 2k)! Dk 
t,x] = = oe 8.37 
ae 2 2 O Emn- okm =I” BA 


Therefore, identifying the Legendre polynomial P,, with the coefficient of £”, we obtain a 
series representation 
lezi (2n — 2k)! ai 


= yk 
FLX] = 2! A — 26! (8.38) 


of explicit, albeit cumbersome, form. 
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This series can be expressed in the form 


Ln/2] 1 


P [x] = 20 a ma (8.39) 


pin lax) * 


Using k > 5 => 2n — 2k < n, we recognize that n derivatives would eliminate any term 


of the form ee 2k with k > |n/2]. Therefore, we are free to extend the upper limit of the 
summation to n to obtain 


dy" n xk 1 n_n 
Enix] = la -) 2 E lk! Z'n! (=) LO y pe ye 8.40) 


where the final sum is simply the binomial expansion of ES -= 1)”. Therefore, we obtain 
Rodrigues’ formula 


= 1 dY 2 n 
P,[x] = 74) (x* — 1) (8.41) 


for the Legendre polynomials. Notice that the parity P,[—x] = (—)"P,,[x] is obvious using 
Rodrigues’ formula. 


8.2.3 Schlafli’s Integral Representation 


Extending Rodrigues” formula into the complex plane 


1 d n 
P = — | — 2 = 1 i‘ 42 
= (E) -D (8.42) 
and applying the Cauchy integral formula to 
1 t2 -1y 
(2-1) = — $ ( "de (8.43) 
2ri t=Z 


we obtain the Schläfli integral representation 


(2 -1)" 


P 
n [z] = t- (1-3 


(8.44) 


where the contour encloses z. An advantage of this integral representation is that it provides 
an analytic function for positive integer n that reduces to the Legendre polynomials for z > 
xin the range —1 <x < 1. 

8.2.4 Legendre Expansion 


The Legendre equation can be expressed in a form 


(Ela -E)n 1)) PE == () (8.45) 


that is manifestly self-adjoint. Orthogonality between Legendre polynomials is immedi- 
ately evident, but the normalization must still be evaluated. This is simplest using the 
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generating function to evaluate 


1 
1 —2xt +17 


=> Pile ie (8.46) 


n,m 


such that 


1 dx 1 
f = =) 0 f P [x]P [x] dx (8.47) 
112 -1 


n,m 


The integral is obtained by an elementary change of variables 


1 d+? co 2n 
d 1 d 1 l+t t 
[== =x Í = irog "| =2 (8.48) 
al 1-2xt+t 2t day y t 1-t s 2n+1 


where in the last step a power series permits identification of the normalization integral as 


1 
2 
f PLDI n [x] dx = Sate m (8.49) 
1 2n+1 ” 


Therefore, the Sturm—Liouville theorem tell us that a piecewise continuous f[x] can be 
expanded on the interval |x| < 1 in a Legendre series 


00 


fal => a,P, lx] (8.50) 
n=0 
2n+1 ! 
a, = > 1 dx P lx] f[x] (8.51) 


that converges in the mean. Expansions of this type will prove quite useful in solving 
boundary-value problems based upon second-order partial differential equations. 
Of course, one of the most important Legendre expansions 


00 


óx- x] = $ a,P, Lx] (8.52) 


n=0 


provides a representation of the delta function. Using the orthogonality relation, the coef- 
ficients are simply 


2n+1 f! 2n+1 
a, = L f dx P,[x]é[x — x’] = L P [x] (8.53) 
-1 


n 


Therefore, we obtain the completeness relation 


ólx-x]= Y mi TEPI (8.54) 
n=0 
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8.2.5 Associated Legendre Functions 


Separating the Laplacian operator in spherical polar coordinates, one obtains an equation 
of the form 


E e [(Sintal y) t+) 


Sin[6] do J [Cos[6]] = 0 (8.55) 


nm 
Sin[9]? 
in terms of the polar angle 6. Thus, defining x = Cos[6] we obtain an associated Legendre 
equation 


( n È d 
0-5 -xE + (104 


m2 
a a 7 >) pts =0 (8.56) 


that is similar to the ordinary Legendre equation except for the additional contribution 
proportional to m?. Rather than attempting a frontal assault, it is sufficient to demonstrate 
that given a solution y,[x] to the ordinary Legendre equation, one can generate a solution 
to the associated Legendre equation using a transformation 


d m 
Yml = (1 an) y, [x] (8.57) 


that is motivated by Rodrigues” formula. If we write the Legendre equation for a specific / 
as 


(1-19 = 2xy +10 + Dy =0 (8.58) 
where y“ denotes the n derivative, and differentiate the equation once, we obtain 

(d = x*)y = 4xy® + (10 + 1) - 2)y = 0 (8.59) 
A second differentiation then gives 

(1 = x*)y — 6xy + (10 + 1) - 6)? = 0 (8.60) 
It is useful to define u, = y such that 

(1 — Ju, — 2(m + 1)xu’, + (10 + 1) — m(m + Du, = (8.61) 


is valid for O < m < 2. Assuming that the equation is valid for a particular m and differen- 
tiating again, we obtain 


(1 =U — 20m + 2)xu + (10 +1) — (m + Dn + 2))u? = 0 (8.62) 


Un 


1) 
and use u% = 1511) to rewrite this equation as 


ad- Au — 2nxuy, + (a +1)-n(n+ DJu, ==0 (8.63) 


where n = m + 1. Therefore, by induction, the second-order differential equation applies 
to any u,, with 0 < m < /. However, this equation is not yet in the desired form and is not 
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self-adjoint. The final step is to define 
W =1- => ==", (8.64) 
and to show that v,, satisfies a self-adjoint equation. Allowing MATHEMATICA to evaluate 
the derivatives 
ulx_] = (1 -x°)™?v[x]; 
w [x] 
mx(1-x?)*2v[x] + (1 - x?) ?v [x] 
u” [x] / / Simpli fy 
(1-2) (m (1 + (1 +m)x*) v[x] + (-1 + x’) (-2m xv’ [x]+ (-1 + x’) v” [x])) 
and substituting into the differential equation for u,, 


A= (1-x2)u” [x] - 2x(m+1)u’ [x] + (101 +1) -m(m+1))ulx] / / Simpli fy 


m 


(1-x2)+2 (- (m? +(-1+ x’) 1(1+1)) vix] + (-1 +x?) (2x v'[x] + (-1+ x’) v” [x])) 
A(1-x?)"/2/ / Simpli fy 
(m? + (-1 +x?) 1(1+1)) vlx] - (-1+x?) (2x v' [x] + (-1+x?) v” [x]) 
-1+x? 

we conclude that y,„ does indeed satisfy the associated Legendre equation for 0 < m < /if 
y, Satisfies the ordinary Legendre equation. 

The associated Legendre polynomials can now be expressed in terms of Rodrigues’ 
formula as 


1 d 1 5 i Mg = y d l+m ; y 

a O 

or 
m 2\m/2 dy” 
P mnb] = EPA 74] Pl] (8.66) 
i dx 

where the Condon—Shortley phase (-)" proves convenient for quantum mechanics. Note 
that P, olx] = P,[x]. Here / is denoted the degree and m the order of P, „. Using Rodrigues’ 
formula we immediately find 

P, nl] = (YP [0] (8.67) 
and 

m +0 => P nll] =0 (8.68) 


Although our derivation assumed that m is nonnegative, one can show that the generalized 
Rodrigues’ formula offers a valid solution any integer in the range —/ < m < 1. However, 
the solutions for negative m are related to those for positive m according to 


(J —m)! 

P 8.69 
qa mi erll en 
and thus are not independent and we do not obtain a normal completeness relation. Never- 


theless, the solutions for negative m will be useful for spherical harmonics in which the ¢ 
dependence distinguishes the sign of m. 


P, mlx] = Y" 
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Some attention must be given to the phase convention for associated Legendre poly- 
nomials. Many authors of textbooks on mathematical physics, such as Arfken or Butkov, 
omit the factor of (—)” in the relationship between P, and P,,, on the grounds that it is an 
unnecessary complication at this stage. However, this phase is convenient for applications 
of spherical harmonics in quantum mechanics, which are based upon associated Legendre 
functions. Therefore, other authors of recent textbooks, like Lea, do include this factor 
in P, m. Hence, it is incumbent upon the reader to be cognizant of the conventions chosen 
by the author. (Caveat emptor!) We have chosen to include the (—)” phase for three practi- 
cal reasons: (1) it is used by the most useful compilation of mathematical formulas, that by 
Abramowitz and Stegun; (2) my favorite mathematical software, MATHEMATICS’, also uses 
this phase in its definition of associated Legendre functions; and (3) it provides a more 
natural extension to complex arguments. 

The orthogonality properties for associated Legendre functions can also be developed 
by using Rodrigues’ formula to write 


1 y l+m 
Í P, mlx] Py m [x] dx = T f (= ey (6 A (d i ey) 


d +m Se 
x (5) (1 =-x%) Ja: (8.70) 


where it is important to note that m is the same for both Legendre functions. We assume, 
without loss of generality, that / < /’ and integrate by parts /’ + m times to obtain 


1 141 I'+m ) +m 
S PunleDPe glad ds = 7 — fa- ey (4 


ATA] 
23m d y tm 241 
x i e) AS (l-x y) dx (8.71) 


where the integrated terms vanish at both limits. The integrand can be simplified using 
Leibnitz’s expansion of multiple derivatives of a product 


d ‘ ` n n-m m 
(E) cen =) (7) gD] (8.72) 


m=0 


in terms of binomial coefficients and products of derivatives. Thus, we obtain 


+m 


oer eet) SOIT 


k=0 
l+m+k 
(ÉS a-y) (8.73) 


Notice that because (1 — x?)¥ has only 2u nonvanishing derivatives, nonvanishing terms 
require 
l +m-k<2m 


=> <], -ms<k<l-m (8.74) 
l+m+k<2] 
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Yet we assumed / < /’. Therefore, this integral vanishes for /’ + 7, demonstrating the 
orthogonality of associated Legendre functions with the same m but different / values. 
Furthermore, the normalization for / = /’ can be obtained from the single term with k = 7 — 
m, such that 


ii l+m 2m 
f ( P mlX x dx = a (7: a m a el ña a - y) 
-1 


d 21 j 
(a (l-x y)as (8.75) 


E > +M) emen KESI 
(2!) N 


Although the remaining integral can be evaluated most simply as a finite sum by expanding 
the integrand, more clever techniques are needed to obtain the simple closed-form result 


ey 
i (1 — x’) dx = GDI (8.76) 


and are relegated to the problems at the end of the chapter. Collecting these results, we 
finally obtained the orthogonality integral 


5 _ 2 (+m)! 
Í P, ym Pp, nlx x] dx E (21 + 1) Em (8.77) 


Having two indices, the associated Legendre functions exhibit a wider variety of recur- 
sion relations than the ordinary Legendre functions. Some recursion relations vary the 
degree, others the order, and many others vary both. Among these are 


2x 
P, m21] + Gn + DTZ Z Pring LX] + (a + 1) - m(m + 1) P mlx] == 0 (8.78) 
Pi, E nai E (8.79) 
ld T (21 + Da = AP n ~ Pirim = 0 (8.80) 
(U—=m+ DP ¿579 — I+ DxP m + OMP aim = 9 (8.81) 


but we leave the derivations as exercises. Note that the choice of phase affects some of the 
signs in these recursion relations. A simple method for deriving recursion relations for m = 
0 is to begin either with the differential equation or with one of the recursion relations for 
Legendre polynomials, differentiate m times, and multiply by a power of (1 — x2)'2 to 
produce an analogous relation for associated Legendre functions. With further analysis 
one can show that the same relations apply under more general conditions, complex x > z 
and nonintegral or even complex /, m > y, 4. 

Rather than develop further properties of associated Legendre functions, I prefer to 
proceed directly to the spherical harmonics in which the ¢ dependence distinguishes the 
sign of m. 
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8.2.6 Spherical Harmonics 


For problems in three dimensions it is convenient to define angular basis functions with a 
simple orthonormality with respect to solid angle, such that 


2r 1 
f Yin VE mw [°] dû = T ii Yim [0, Yr n [0, $] dcos Odo = 61.1 Om m (8.82) 
0 -1 


The dependence of the spherical harmonics Y,,,[6,¢] upon the polar angle is obviously 
related to the associated Legendre functions while the dependence upon the azimuthal 
angle is proportional to Fourier functions e*™”*. Thus, we define 


27+1(/-—m)! 
47 (1+m)! 


1/2 . 
Y, m0, 9] = ) P, [cos O]e*" (8.83) 


where the ugly factor provides the desired normalization. Using the standard definition 


for P, „ With m < 0, we obtain the conjugation property 
Yin 0, 91 = "Y, 18 0] (8.84) 


Similarly, under inversion of the coordinate system 


P> F—>0>28M-0, 677+ (8.85) 
and using 

P nix] = HP ll, Explim(¢ + 2)] = (-)” Explimg] (8.86) 
to write 

Y,,, [7-07 +6] = Y Y,,, 10, 6] (8.87) 


we identify the parity 
Y, m LA = (VY, „n LP] (8.88) 
as (-Y. It is also useful to know that 


21 1 1/2 
= P,[cos 0] 5,0 (8.89) 


Y, 10, 0] = ( 


g] 1 1/2 
E ) m,0 (8.90) 


Vin EE] = Y,„10,0) = (= 


The degree / is usually related to orbital angular momentum while the order m is the 
projection of the total angular momentum on the polar axis. 
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8.2.7 Multipole Expansion 


Recognizing that the Legendre functions are complete for the polar angle while the trigono- 
metric functions are complete for the azimuthal angle, we expect the spherical harmonics 
to provide a complete orthonormal basis for functions defined on the unit sphere, such that 


EY y FY E (8.91) 


1=0 m=-1 


fa = ia 


2r 
-f L FIO, $1Y, „l0, J" dcos 0 do 


= f f: FIO, OY; „lO, I" Sin[0] 49 do (8.92) 
o Jo 
where the coefficients f, „ are constants. Suppose that the angular delta function 
Sl? — 7] = d[cos 0 — cos O ló[ó — ¢’] = y y A; ml? IY, „IP (8.93) 
1=0 m=-1 


is expanded with respect to unprimed variables using expansion coefficients which are 
functions of the primed variables, here treated as parameters. Using the orthogonality prop- 
erties, the expansion coefficients become 


Ajnl?] = f or — PY, 171" dO = Y, [PT (8.94) 


Therefore, the completeness relation for spherical harmonics takes the form 


o0 


1 
= YY, ib nl? T= Y inln] (8.95) 


1=0 m=-1 l,m 


where the final sum uses a short-hand notation that indicates summation with respect to 
both variables over their natural ranges, namely 0 < 7 < œ, -I<m < 1. 

Notice that it does not matter which of the spherical harmonics in the completeness 
relation is conjugated because this combination is real. The summation over m resembles 
the summation over components used to evaluate the scalar product of two vectors. In fact, 
Y, „ represents a component of a spherical tensor of rank / and this summation represents 
a scalar product between two spherical tensors of the same rank. Thus, it is convenient to 
define a scalar product of the form 


Y[?]-¥,[7"] = y il (8.96) 


m=-] 


and to express the completeness relation in the more compact form 


ól- F] = X YIP: YF (8.97) 
I 
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An interesting current application of multipole expansions is to angular variations of 
the cosmic microwave background. The relic microwave background is approximately 
uniform, with an average temperature of 2.72 K, but small fluctuations carry information 
about the expansion of the early universe and can be used to test theories of inflation. Thus, 
one writes 


1 
TIo] = > T,nX10,61, A= IE? (8.98) 
I 


m=-1 


where 474, represents the average temperature, A, is related to the motion of the sun 
relative to the preferred reference frame, and higher multipole strengths carry the desired 
cosmological information. 

Next consider a scalar function of 7 expanded in the form 


00 1 
Ms» > 404 (8.99) 


1=0 m=-1 


do f ri, ter do (8.100) 


where the coefficients are radial functions. For example, if f[r] represents a density, 
the expansion coefficients would be described as multipole densities. If f is real, such 
that 


F =F => Sim = Oh (8.101) 


we find that the multipole densities share the conjugation properties of spherical harmon- 
ics. 


8.2.8 Addition Theorem 


Suppose that two vectors, 7 and 7’, are described by polar and azimuthal angles (6, ġ) and 
(8', $”) and that y represents the angle between them, such that cosy = 7? - 7. Using the 
completeness of spherical harmonics, we can expand P,[cos y] in terms Y, ,,[6, 6] according 
to 


co I’ 


P[cos y] = y A Ay 10, Oy ml, 6] (8.102) 


'=0 m=-1 


where the expansion coefficients 


Ayy 16, $] = f Y, „l6, ØP [cosy] d9 (8.103) 
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depend upon the other pair of angles. Recognizing that P,[cos y] is real and using the 
conjugation property of spherical harmonics 


00 E 


P,[cos y] = Plcos y!" = Y” Y, Ap „lo, 917,10, 6] 


’=0 m=-1 


oo r 
= 9, VOM nl, PYY, 1B 6] (8.104) 
’=0 m=-1 
00 £ 
=), 9, "Ar 10, 6 YY, 10, $] 
1=0 m=-1 


we observe that the coefficients 


* 


Arm (9, o] = "Ar n 10, 9] (8.105) 
share the same conjugation property. (Orthogonality of spherical harmonics ensures that 
this relation applies term by term.) Furthermore, 7 - 7” is symmetric with respect to the 
exchange of primed and unprimed coordinates. Therefore, the expansion should take the 
form 


a 


o0 


= y y By mYr mlO, PY m g, oT (8.106) 


l =0 m=-1 


where the coefficients are independent of angles and this combination of spherical har- 
monics is real. Finally, because P[? - 7] is a scalar function its value must be independent 
of the orientation of the coordinate system. Suppose that we choose a coordinate system 
in which ? = 2 such that 


w r 21 +1 1/2 
PF ale P,[cos 0] = = 2 y By. mYr, mlO, a ) 5,0 
=0 m=-1 
(8.107) 


= -> By m (42 >) P, [cos 0] 


The orthogonality of Legendre functions limits the rhs to the single term with /’ = 7. 
Therefore, we obtain the addition theorem 


1 


A N An 17* 
Pl? - r ] = Wei | y Y, ml?) Y, ml? 7] (8.108) 
m=-l 


or, using the scalar product, 


> YF) yP] (8.109) 
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The addition theorem proves invaluable in a wide variety of problems. Some of the 
most important examples are based upon the multipole expansion of 
1 Avr 
==> Pl -P] (8.110) 
Ir -—7'| oats 


which we can now express in the form 


1 “ 4r o a 
ADES TTRI: YP] (8.111) 
1=0 > 


8.2.8.1 Example: Far-Field Solution to Poisson’s Equation 


The electrostatic potential [7] around a charge distribution p[7] satisfies Poisson’s equa- 
tion 


Vt] = -4r pl] = yl] = T GÈ, PPPI Er (8.112) 


where the Green function for a point charge is simply 


ee ee ee eee y m T YI -Y,[”] (8.113) 


-r| i 2+1 
such that 
wir] = f a la dy’ (8.114) 
P= Fl 
The multipole expansion of the charge density takes the form 
pL => Pintat (8.115) 
l,m 
Pimlr] = f PUY, ml?] dO (8.116) 


Suppose that r is much larger than any r’ with appreciable charge density. A far-field solu- 
tion for the electrostatic potential of a localized charge distribution can then be expanded 
as 


4 
T limy iş] (8.117) 


r>r =y >, aion 
l,m 


where the spherical multipole moments q,,, are given by 


Dim = Í rY, „lP dr = f r dr (8.118) 


Recognizing that the contributions of higher multipole moments are suppressed by increas- 
ing powers of r, one expects that it is sufficient for very large r to employ only the lowest 
few multipoles. Therefore, the multipole expansion provides a very efficient solution for 
asymptotically large distances. 
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8.2.9 Legendre Functions of the Second Kind 


Generally a second-order differential equation must possess two linearly independent solu- 
tions, but our anticipated application to the polar angle dependence of physics problems 
with x = Cos[6] in the range —1 < x < 1 selects P, [x] as the most relevant of the two solu- 
tions. The (sometimes) implicit requirement of analyticity at x = +1 is actually a boundary 
condition that forces the eigenvalue in the differential equation 


(1 — xy” [x] — 2xy’ [x] + Ay[x] = (8.119) 


to assume the values A = n(n+ 1) where n is an integer and selects the corresponding eigen- 
functions P, [x] that are regular at the singular points of the differential equation (where the 
coefficient of y” vanishes). However, if the polar angle is limited to a more restricted 
range |x| < a < 1, the second solution is required. Linear homogeneous boundary condi- 
tions at the smaller angular limits still yield an eigenvalue problem, but the eigenvalues are 
more general and the eigenfunctions are linear combinations of the two solutions. 

We apply the Frobenius method to construct solutions from power series about the 
origin. Let 


st ag? (8.120) 


where the appropriate value of r has yet to be determined. Direct substitution gives 


y a,((n + r)(n r- DO 2072 — Ln + ra + Ax") =0 (8.121) 
n=0 

or 
3 ap(n+r+2)(n+ r+ Dx = > a,((n + rin +r + 1) A)" (8.122) 
n=2 n=0 


The left-hand side has two powers that are absent from the right, namely n e {—2, —1), but 
the coefficients of both vanish if we choose r = 0. In order that this equation be satisfied 
for all x within the radius of convergence, we equate the coefficients for like powers of x 
to obtain a recursion relation 
an2  mnt+1)-A 
a, ~ (n+2)(n+1) 


(8.123) 


that determines two solutions, one for even and another for odd n. Therefore, the general 
solution takes the form 


2 4 6 
yx] = 4,(1 -a3 -6-07 - (6-920 -M5 =>] 
e x5 x! 
+a, (++ 2-03 +0 -12- 15 +(- (12-060 - 05 +] (8.124) 


where a, and a, are arbitrary. 
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Convergence is a key issue. The ratio test 


lim 


n>00 


s2 po] i= Ke (8.125) 
a, 

informs us that solutions for arbitrary A diverge on the unit circle. (Although the ratio test 
is technically inconclusive at |x| = 1, the integral test is conclusively divergent.) However, 
divergence is avoided when A = /(/ + 1) for nonnegative integer / causes the power series to 
terminate in a polynomial of order / that is identified as the familiar Legendre polynomi- 
als P,[x]. If 7 is even, the function associated with a, is proportional to P, while the function 
associated with a, diverges at x = +1, whereas if / is odd a, produces P, while a) produces 
a divergent function. The Legendre functions of the second kind, Q,[x], are identified with 
the divergent series except for a conventional normalization yet to be specified. 

The lowest few Q, are illustrated in Fig. 8.2. Obviously, these functions become more 
oscillatory as / increases and have parity O,[—x] = C) o k] opposite that of the corre- 
sponding P,. For integer order one can factor the logarithmic divergence at the end points 
from polynomial portions to obtain 


WA Ode 


7 La (2m + 1)! = m) 


1 1 
ola = 5Ple] Log | Pial] (8.126) 


l-z 


Closer analysis shows that P,[+1] = 1 for all v, but P,[x] diverges logarithmically at 
x > —1 for v + /. However, O,[x] exhibits the opposite behavior: it is divergent at both 
singular points of the differential equation, x = +1, for v = / but is finite atx = —1 
for v = (2/ + 1)/2. 


8.2.10 Relationship to Hypergeometric Functions 
With suitable variable transformations, the hypergeometric equation 
t0 = tu’ [t] + (y- (@ +8 + Dt)u'[t] - aut] = 
> y[t] = AF[a, B, y, t] + Bt! F[1 -y +a, 1 -y+ß,2-y,t] (8.127) 


contains as special cases many of the second-order differential equations of interest to 
mathematical physics. Using standard methods, one can show that the power series 


Fla, B, y, t] = (Ma (8.128) 
n=0 O, n! 
where 
(a), = 1 (8.129) 
(a), = aa+1)---(@+n—-1) (8.130) 


converges for |t| < 1. Furthermore, when a or £ is a nonpositive integer, the series termi- 
nates and the hypergeometric function F reduces to a polynomial. Although neither time 
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Figure 8.2. Legendre functions of second kind. 


nor space permits full development here of the theory of hypergeometric functions, which 
can be found in more advanced texts, it is still of interest to express the Legendre functions 
in terms of hypergeometric functions. 

Consider the variable transformation 


t=30-2 w(t] =-2y'lel, ul] =4y"Lel, 1-1) = 50-2) (8.131) 


such that 
(1 -2)y"[z] + (-2y + (@ +8 + DU -2)y"[z] - apylz] = 0 (8.132) 
By choosing 
a+B+1-2y==0 (8.133) 
a+B+l1=2 =y=1, a=-1, fP=v+1 (8.134) 
aß == —v(v+1) (8.135) 
we obtain Legendre’s equation 
(1 = 2y" E] = 22y [2] + v0 + Dyle] = (8.136) 
and identify 
[=z 
k-11<2 = Pi] =F 1141, l, > | (8.137) 
Pid = yr] +L lp (8.138) 
2 TU Neal in EE 
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as the Legendre function of the first kind in term of a Taylor series around z = 1 with the 
conventional normalization P,[1] = 1. Alternatively, the substitutions 
2 v+2 v+1 v+3 


t=zx", a= 7 a p= 2 ` y= 2 (8.139) 


also transform the hypergeometric equation into Legendre’s equation, permitting the 
Legendre functions of the second kind to be identified as 


yr Tlv+1] uE v+1l v+3 =| 
ear 2 a 


oie (2-1)? Tiv+u+1] 
Vil <1 = grt! zed Tlv+ 3 


k> 1 = 0,[z] = (8.140) 


v+u+2 v+u+1 v+3 _, 
2 , 2 > rf 
(8.141) 


el" F | 


when vis not a negative integer. The first factor is made single-valued by a cut on (—oo, +1) 
and the normalization is conventional. Definitions for larger regions can be obtained by 
analytic continuation, but we refer the reader to standard references for further details. 


8.2.11 Analytic Structure of Legendre Functions 


The most general form of the associated Legendre equation is 


2 
(1-2)y"[z] - 2zy'[z] + (o rije A > 


E = 
= yiz] = AP,,,[z] + BQ, ,[z] (8.142) 


where {z, v, u} are permitted to be complex. Most of our results for Legendre functions of 
real arguments in the range —1 < x < 1 can be extended to the unit disk |z| < 1 simply 
by replacing x > z in series representations, recursion relations, and other formulas even 
if the degree v or order u is permitted to be complex. However, for arbitrary z one must 
pay close attention to the choice of branch cuts. The most common, although not unique, 
choices are listed in Table 8.2. Each is then an entire function of z in the cut plane, and 
customary values on the real axis for —1 < x < 1 can be defined using suitable averages of 
values above and below the cut where € > 0*. 


Table 8.2. Branch cuts for Legendre functions. 


Function Cut Value for=1<x<l 


Plz] (oo, -1) P,lx] 

0,Lz] (oo, +1) Q,lx] = (O, [x + ie] + O, lx — ie])/2 

P iz]  (=œ,+1) P,,[x] = Explinu/2]P,, [x + ie] 

Qiz] (+o,+D  0,,[x]=Expl-izul(Expl-imu/210,,[x + ie] + Explizu/210,, [x — ie])/2 
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If we define 


Sy El AP, Jel + 82,12] (8.143) 


where A, B are constant coefficients independent of {y, u, z}, we find that the recursion 
relations 


2 

Pomo + Ut ey? ru ~ (VY + 1) pe DD) fy == 0 (8.144) 
a 

La tra 1)(2 = 1)? Ahi 20 (8.145) 

Fari OV 4 De = y? Fig ~ A (8.146) 

(v-pt Di eis — (2v + Deva +(v + cia == 0 (8.147) 


appear slightly different but that the previous results for z > x with —1 < x < 1 will be 
recovered if (z? — 1)? is also handled with a cut on (—co, +1) in a consistent manner. 
Similarly, the Rodrigues formulas now take the form 


pal = (E) Plc] = Patol = ay (<P 6.148) 


m 


z] = (72 — 1ym/2 dy” — (_\y1 — y2m/2 d 
O, ula = (2-1) (=) 0,1 = O, 1x1 = "4 - 2) (=) 0,[x] (8.149) 


for nonnegative integer m. These results offer another justification for retention of the (—)” 
phase when the argument x = Cos[6] is in the range (—1, 1). 
Generalizing the Schláfli integral representation of P,[z], one can show that 


ei =a ESO y 8.150 

Mya E 
al (2 - 1)” 

Q, Iz] (8.151) 


~ 4iSin[va] Je, C-D" 


provide solutions to Legendre’s differential equation for arbitrary n > v, but the contours 
for noninteger v must be chosen carefully because the integrands have branch points at t = 
—1, z, +1. Suitable contours are sketched in Fig. 8.3. Legendre functions of the first kind 
are obtained using contour C} that encloses both ¢ = z and ft = 1 with one cut below the 
real axis for f < —1 and a second between t = 1 and £ = z. After integration with respect 
to £, the cut between 1 = +1 and z becomes irrelevant but P,[z] is left with a branch point at 
z = —1 for noninteger v and a cut below the negative real axis from there to —co. With these 
choices we find that P,[1] = 1 for any v. Alternatively, the bow tie contour C, provides a 
related integral representation for O,[z] . For C,, z must be outside both loops so that the 
phase changes, incurred by circumnavigating the branch points, cancel. A more detailed 
analysis shows that a single-valued definition of Q, requires a cut from z = +1 to z = —oo. 
Similar integral representations can also be constructed for associated Legendre functions, 
but we will be content to stop here. 
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Figure 8.3. Contours for integral representations of Legendre functions. 


Finally, one can also show that, for fixed z, P, is an entire function of v while Q, 
is a meromorphic function of y with simple poles at negative integers. This property of 
Legendre functions is useful in quantum scattering theory where significant insights can 
be obtained by treating the orbital angular momentum as a complex variable. More detailed 
development of the properties of Legendre functions of the second kind can be obtained, 
when needed, from standard references. These developments are omitted here because we 
will not find much use for these functions in the remainder of the course. Suffice it to say 
that most properties can be derived from the Schláfli integral representation. 


8.3 Bessel Functions 
8.3.1 Cylindrical 


Variants of Bessel’s differential equation arise frequently when the method of separation 
of variables is applied in cylindrical or spherical coordinates to partial differential equa- 
tions based upon the Laplacian operator. However, it is instructive to begin our study of 
these functions using the generating function instead of the differential equation, similar to 
the approach used for Legendre functions. The generating function for cylindrical Bessel 
functions of the first kind, J, [x], takes the form of a Laurent expansion with respect to t 


awn 


glt, x] = Exp|5( ¢-rd|= > Je (8.152) 


n=-—00 
whose coefficients are the desired functions of x. Bessel functions with negative n are 
related to those with positive n using the symmetry 


gl, x] = git, x] = J_,[x] = (J, x] (8.153) 
Similarly, we obtain the reflection symmetry 

gl-t, =x] = alt, x] = J,[-*] = ("J [x] (8.154) 
such that 

J_,[-x] = J,[x] (8.155) 


for integer n. 
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Some of the basic recursion relations can be obtained, as before, by differentiating 
glt, x] with respect to either variable. First consider 


Og{t, x] 
Ot 


=" + 1) Exp| že -r D) = y nJ [x]! (8.156) 


n=—0 
and substitute the generating function to write 


00 


x = n n= 
sae +1) >. J, [x]t” = 5 nJ Je! 


n=—oo n=—oo 


= > ice mar DJ, 1)" =0 (8.157) 


n=-00 


If this series is to vanish for arbitrary t, the coefficients must vanish separately for each 
term. Adjusting the indexing, we obtain the three-term recursion relation 


hbd] = 3), Pd + 95,1 Ex] (8.158) 


Similarly, differentiation with respect to x 


MA o Y) pbl Y) Sa id: 


n=-00 n=- 


provides a recursion relation 
2V [x] = J, bd] -4,,,, lx] (8.160) 


for the derivative, almost by inspection. Manipulation of these recursion relations provides 
several additional relationships: 


IMAGES (8.161) 
Ln y, bd) = 0,2, La] (8.162) 
dx 


A power series for J, is obtained by factoring the generating function and expanding 
each factor according to 


e E) 
DS 


n,m=0 


(8.163) 


The coefficients for each nonnegative power of t are identified as 


n>20>J,[x] = y E (a. (8.164) 


= (n+ m)!m! \2 


8.3 Bessel Functions 291 


J y 1x] 


X 
Figure 8.4. Bessel functions of first kind. 
or 
n=0 =J= (5) > A AN (8.165) 
7 oe (n+m)!m!\ 4 


m=0 


where the leading power has been factored out and the series written as an even function 
of x. Similarly, for negative powers we obtain 


J = x ee (3 (8.166) 


== (m—n)!m! \2 


However, because the coefficients vanish unless m > n, we use m > n + k to write 


E A (SJ pxyn+2k oe 
J] = 2 on (5) = Je] = Jx] (8.167) 


and again conclude that changing the sign of the index does not yield an independent 
function. One can show that the power series is convergent for all x, even if it is inefficient 
for large x. 

Figure 8.4 displays the first few Bessel functions of the first kind. The functions oscil- 
late with slowly decreasing amplitudes. Their roots are interleaved, as expected for solu- 
tions of a Sturm—Liouville system. 

Notice that elf, z] is an analytic function of either variable, except for the essential 
singularity at £ = 0. Thus, 


co 


Jel A E (8.168) 


tm! 
pe (n+ m)!m! \2 
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is an analytic function whose radius of convergence is infinite. We can generalize this 
series 


zv 00 1 2 m 
Jiz] = 5) ere: 1]Fim + 1] (5) eee 


to arbitrary v using a branch point at the origin for noninteger v and intepreting the factori- 
als as gamma functions. The cut is normally taken below the negative real axis. However, 
the symmetry between positive and negative integer order n does not apply for noninte- 
ger v. Near the origin one finds 


Y 
J,[z = Tiv + 17 (=) (8.170) 
2 
provided that y is not a negative integer. Recognizing that the first n — 1 coefficients of the 
power series vanish when y = —n, we find 


Ja = (-',[z] x Tin + 17°! (-5) (8.171) 


as expected. 
It is now a simple matter to verify that the basic recursion relations 


4d = Fal + Jl (8.172) 
2F,[z] = J ak] — J,,;[z] (8.173) 


continue to apply for noninteger v and complex z. The differential equation satisfied by 
J,[z] can be deduced from the recursion relations (or from the series). Suppose that £_[z] 
is a family of functions that satisfy the basic recursion relations 


2vf,[z] = zyl] + flh (8.174) 
2f,[z1 = £, ¡[21 — fyi [2] (8.175) 


but are not necessarily limited to cylindrical Bessel functions of the first kind. From these 
one deduces 


furl = Ae- fi (8.176) 
fala = AS + fill (8.177) 
such that 


¿fa = zf, ¡Edy = € el-ze O- faketo DeO (8.178) 
or 


¿fi d- ¿fla —zf,slzl + 7+ efile] = 0 (8.179) 
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Substituting 
zf = v- DA, ¡Ll -24 [2 (8.180) 
zf kl = vfl + zhi (8.181) 

such that 

¿fla =e- 1- 2) fe +0- DL (8.182) 


one obtains 


PRE- (p0 - D-2) +- D) - fle) + RED 
+ (v+ Dzfi[z] = 0 (8.183) 
Therefore, we finally obtain Bessel’s differential equation 
¿f d+ efile) + (2-1 = (8.184) 


for which J_[z] are solutions that are regular at the origin for integer v. Since this is a 
second-order differential equation, there must exist a second linearly independent set of 
solutions that also satisfy the same recursion relations. 

It is important to recognize that the same recursion relations apply to any solution of 
Bessel’s equation or to any linear combination of solutions with the same v, provided that 
the coefficients are independent of both v and z. Thus, the recursion relations 


2 
fll + fll = “pA (8.185) 
f,-ilz] — fa] = 2AE] (8.186) 
d 
dz 94,12) = #2*” Feil] (8.187) 


are general properties of cylindrical Bessel functions. (We leave it as an exercise to the 
reader to derive the third formula from the first two.) 

If we divide by z, Bessel’s equation appears to be self-adjoint with respect to the weight 
function z provided that we use appropriate boundary conditions. If we assume that v = 0 
(the usual situation), the requirement that the solution be finite at the origin selects J, while 
requiring either a node or a vanishing derivative at a fixed radius R provides an eigenvalue 
equation for either Dirichlet or Neumann boundary conditions. Thus, it is useful to change 


variables such that 
P 


sger (e - a}flEl=0, flO] finite = fle «J, (8.188) 
where the boundary conditions 

Dirichlet: f[R] = 0 = J,[k,,é] = 0 (8.189) 

Neumann: f’[R] == 0 = J)[k, ,€] == 0 (8.190) 


yield sets of eigenfunctions indexed by the positive integer n. It is now a simple matter, 
following the procedures of the Sturm—Liouville chapter, to demonstrate orthogonality 
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between two solutions with the same v but different eigenvalues. Therefore, we obtain 
eigenfunction expansions of the form 


FIé] = 2,404 + [ky nf] (8.191) 


with the aid of the orthonormality relations 


Jk, ,R] = 0 >f J lk, „él, lk, „lé dé = A (8.192) 


y E 2 
Ji[k,,R]=0= [ Ie EN KEDE dE = > 5h 7 (5) Jott Sinn 
(8.193) 


that are developed in the exercises. The appropriate choice of basis depends upon the 
boundary conditions. Note that summation is performed over n for fixed v = 0. The appro- 
priate value of v is determined by other aspects of the problem, such as angular momentum. 
An important special case is the limit R > oo for which the spacing between eigenvalues 
becomes infinitesimal. With careful attention to the limit, one can prove that 


Ó[k — k’] 
Vkk’ 
olg =$] 
vee’ 
for v = 0. The right-hand side is the appropriate delta function for cylindrical coordinates 


with a linear weight factor; here the denominator is written in a symmetric form and bal- 
ances the weight factor such that 


i dé Ef{e) “= 


The details are left to the exercises. 
The generating function provides a Schláfli integral representation 


dra = Bait- r] Y) tar > jel = [e] 


n=-—00 


f J [KEV [ké] dé = (8.194) 


f S J [KEV [ké lk dk = (8.195) 
0 


= flé'] (8.196) 


(8.197) 


where the contour encloses the origin in a positive sense. For a circular contour, one obtains 
Bessel’s integral 


i 1 27 
t =e” => J |z] = = a Expli(z Sin[6] — n0)] 49 (8.198) 
0 
or, more simply, 
1 TT 
Jnlz] = = f Cos|z Sin[6] — n6] do (8.199) 
0 


where z may be complex but n is integral. 
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8.3.2 Hankel Functions 


Hankel functions are obtained by generalizing Bessel’s integral representation to noninte- 
ger order v. Consider 


ol z 11] dt 
fel= zg f Exp| 50-1 ) al (8.200) 


where, except for avoiding the origin, we leave the integration path arbitrary for the moment. 
If we apply Bessel’s differential operator to both sides 


Lf Tl + <a + (2-1 


1 7 E L1 2 2 Z 1] dt 
== (Fe: pe ear Jer at Exp[5(-1 | (8.201) 


the resulting integrand is a perfect differential, such that 


Z friz] + zf] + (2 - Y) fa = > ¿ARS z] (8.202) 
where 
Et 2] = r” (Fe +r) +y) Exp| 5 ( =] (8.203) 


and where A indicates the difference between values at the endpoints of integration. When 
v > nis an integer, F, [t, z] is single-valued and the right-hand side vanishes for any closed 
path; hence, f, is a solution to Bessel’s equation under those conditions. (Paths which do 
not enclose the origin, though, degenerate to the trivial solution f, > 0.) When v is not 
integral, we require a branch cut to interpret the integrated term. It is customary to cut 
the t-plane below the negative real axis. We must also choose a contour for which F[t, z] 
has the same value at both ends for any z within a usefully large domain. The simplest 
method is to choose semi-infinite contours that exploit the limiting properties 


t>0* => Filt,z] ~ (=) Exp |-E| — 0 for Relz] > 0 (8.204) 
Sy zt 
t > -œ => F ft z]~t "3 Exp B — 0 for Re[z] >0 (8.205) 


Thus, the two contours shown in Fig. 8.5 provide integral representations of the Hankel 
functions 


1 z -1,] dt 
Rel] > 0 = Ai] = — [ exp[zo -t y] a (8.206) 
where the normalization is conventional. Similar representations for Im[z] < 0 can be 


constructed by choosing a different cut for £ and rotating the contours accordingly, as 
detailed in the exercises. The Hankel functions satisfy the same recursion relations as the 
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In{t] 


Re[r] 


Figure 8.5. Contours for integral representation of Hankel functions. 


Bessel functions because they are solutions to the same differential equation derived from 
those recursion relations. 

Note that the same integrand provides two independent solutions to Bessel’s equation, 
H[z] and H[z], depending upon whether the negative real axis is approached from 
above or from below. These solutions are distinct even for integral v, despite the fact that 
the cut is not needed, because neither contour is closed. Thus, the ordinary Bessel func- 
tion, J,[z], must be a linear combination of these Hankel functions. For simplicity, suppose 
that v > n actually is an integer. Recognizing that the portions of the two contours that lie 
along, or infinitesimally close to, the positive real axis are traversed in opposite directions 
and cancel, the two contours can be joined to form a closed contour for H® [z] + H® [z] 
that can be deformed to that used for Bessel’s integral representation of J,[z], namely a 
closed contour enclosing the origin in a counterclockwise sense. Thus, we obtain 


J [z] = (80d + HL (8.207) 


and expect this same relationship to apply continuously to nonintegral v. After all, Bessel 
functions for fixed z can be interpreted as continuous functions of v even if that argument 
is customarily represented as a subscript instead of as a second variable. Alternatively, one 
can develop a power series by expanding the factor of Exp[—z/2r] and using the integral 
representation of the I function to obtain the coefficients for powers of z, and obtaining 
thereby the known series for J,[z]. Therefore, 


3,2] = $A [z] + HP (8.208) 


provides a definition for arbitrary v and, more importantly, an integral representation 


dt 


pal 


as 
2 


Relz] > 0 => J,[z] = n f Exp| t- r')| (8.209) 
E 


based upon the contour in Fig. 8.6. 
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Figure 8.6. Contour for J,[z]. 


The Hankel functions and their spherical analogs are especially useful for analyzing 
scattering solutions in the far field. Using the method of steepest descent, one finds 


HOW] z pew Exp li (z - Qv+ DZ) (8.210) 


Haje pew Exp [-i(< -(Qv+ Do) (8.211) 


for Re[z] > 0. (This exercise is left to the student as an opportunity to refresh his/her skills 
with the method of steepest descent.) Thus, we identify 


H[ké] Exp[-iwt] ~ = Expli(ké — wt)] Exp [in + D7] (8.212) 


as an outgoing cylindrical wave associated with the angular function P, [Cos{6]] and 


HO [kë] Exp[-iwr] = | 7 Exp[-i(ké + wt)] Exp [an + D7] (8.213) 


as the corresponding incoming wave for large radius €. Here k is the wave number and 
the asymptotic condition requires k >> 1 and modest n; for large n a somewhat more 
sophisticated analysis may be needed to obtain the required accuracy. 


8.3.3 Neumann Functions 


Having obtained two independent solutions, the Hankel functions, and identifying Bessel 
functions of the first kind as a linear combination of Hankel functions, we can now define 
Bessel functions of the second kind as the orthogonal linear combination 


N [z] = A - Ha) (8.214) 


where this choice of phase yields Neumann functions. (These functions are often labeled Y, 
in the literature and sometimes have different phase conventions.) The Neumann functions 


298 8 Legendre and Bessel Functions 


obey the same recursion relations as Bessel and Hankel functions. From the asymptotic 
behaviors 


Ja ~ pee“ Cos|z -(2v+ D7] (8.215) 
T 4 


N [z] = Pen Sin|z - (2v + D7] (8.216) 


we recognize the Bessel functions as the cylindrical analogs of trigonometric functions. 

Alternatively, we can define Neumann functions as linear combinations of J, and J_,, 
which are independent solutions for nonintegral v. To determine the appropriate coeffi- 
cients, we use 


Ja = 3401 + HO [z)) (8.217) 


and analyze 


1 d 
Hiz] = = fe 0) at (8.218) 


The substitution 


+in tin 


t=—, dt=-> 
S se 


git 1 


ds = H iz] = | Exp|5(=s")] as 
a 
(8.219) 


preserves the integrand up to a phase. In order to preserve the contours also, we must 
choose the upper sign for H“ and the lower sign for 49. Thus, we obtain the symmetries 


AO) = e™ HPE] (8.220) 
HO [z] = e™™H® [z] (8.221) 
such that 
Ja = 34011 + HP [z] (8.222) 
Ja = 4(e™ H” [z] + eH [z] (8.223) 
requires 
HP [z] = i Csc[vr] (eJ, [2] - 3,2) (8.224) 
HÊ [z] = —i Csc[vr] (e "4 [z] - J_,[z]) (8.225) 


for nonintegral v. Finally, substituting these results into the definition for N,, we obtain 
N iz] = Cot[vx]J_[z] — Csc[va]J_,[z] (8.226) 


after a little algebra. This is the most common definition for Neumann functions, but 
appears to be singular for integer v. However, under those circumstances we can apply 
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1”Hópital”s rule to the limit y > n to obtain a result 


1. (ðJ, al 
N,[z] = = lim i i - St) 


227 
dl ðv ðv een) 


T von 


that is continuous with respect to v. 
We now have the means to analyze the behavior of N,[z] near the origin, where 


zv 00 1 2 m 

Jil = (5) 2, Iiv +m + ll [m+ 1] (- =) wee 
nev co 1 2 m 

J_lz] = (5) 2 Ti-v+m>+1]F[m+ 1] (- 4 ee?) 


provide the necessary expansions. Thus, 


, a 2 m 
N,[z] = Cotvr](5 Val J 


m=0 


~ Csefva (5 y 2 Tl-v+m + Lr +1] (- 4 13230) 


for nonintegral v. Although combining these expansions does not seem to offer an attrac- 
tive formula for the coefficients, it is immediately clear that Neumann functions are singu- 
lar at the origin. Hence, it is often sufficient to display only the leading terms. For simplic- 
ity suppose that Re[v] > 0 and that v is nonintegral, such that 


Cse[va] /z)\-” 
Rely] > 0 = Nil = y (5) (8.231) 
Using the identity 
Caro Sali] (8.232) 
we obtain 
AA BN (8.233) 
a \2 


Thus, N, has a branch point at the origin with power-law divergence for v + 0. For the 
special case v = 0, we use 


v 


Oz X 
o z” Log[z] (8.234) 


and the definition of the digamma function 


0 Log[PlzJ] _ T’[z] 


val = ——, Tiz] 


(8.235) 
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to write 
Elo = tim (HA - O E E) (8.236) 
A o A O 
a = ~J.,felLog| 5] + (5) 3 a rm L el (8.238) 


and to obtain 


T “tm el caros] (¿Y °° yin+m+ 1] ( aj 


2) EATin+m+ 1Fim+ JA 4 


ZA = W[-v+m+ 1] 2 m 
- (- ) A T[-v+m-+ IP [m+ 1] (- 4 | (8.239) 


where the final summation requires further attention because both numerator and denomi- 
nator appear to diverge for integer n. We leave the final steps as an exercise for the student 
and quote a final result 


N, lz] = =| 2/, lz] Log | = |- (5 ie > A (=) 
vin+m+1l+yim+11( 2)" 
A J in Tin + m+ Tim + 1] (2) | (8.240) 


in which the coefficients are manifestly finite, even if rather cumbersome. For the present 
purposes it is sufficient to recognize that the divergence of N, near the origin 


2 
Nola] = = Log| =| (8.241) 


is logarithmic. 

Figure 8.7 illustrates the Neumann functions. Aside from the divergence near the ori- 
gin, these functions also oscillate with slowly decreasing amplitude for large x. Note that 
we cannot easily display the values for x < 0 because of the logarithmic contribution. The 
divergence near the origin is logarithmic or stronger: logarithmic for N, or power law (z””) 
for N, with n > 0. 

So why do we need two kinds of Bessel functions? A general solution to a second-order 
differential equation, such as Bessel’s equation, is a linear combination of both kinds. 
Unless the boundary conditions eliminate the Neumann functions by requiring finite values 
at the origin, any expansion based upon eigenfunctions for Bessel’s equation will require 
both regular and irregular solutions. 
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Figure 8.7. Bessel functions of second kind, for n = 0 — 3. 


8.3.4 Modified Bessel Functions 

Modified Bessel functions are solutions to a differential equation 
ld y 

¿ dé ES 


which is related to the ordinary Bessel equation by the substitution k — ik which trans- 
forms a wave equation into a diffusion equation. Consequently, we anticipate solutions of 
the form J [iké] and N, [iké]. It is customary to define modified Bessel functions with the 
phase and normalization conventions 


(Ef TEl) - (e + )ne == 0 => fl€] = AL, [ké] + BK, [ke] (8.242) 


Lal = "Jia (8.243) 
K,[z] = Si HO [iz] = Son (J [z] + iN, [izl) (8.244) 


where the first kind, /,, is regular while the second kind, K,,, is irregular at the origin. 
Figure 8.8 illustrates the first few modified Bessel functions for integer order and positive 
arguments. The phase conventions ensure that both functions are real for positive x. These 
functions are sometimes described as hyperbolic Bessel functions or as Bessel functions of 
imaginary arguments, but the latter is not really an appropriate description because their 
extension to complex arguments provides many of their most important properties (as for 
practically all functions discussed in this course!). These functions do not have nonzero 
real roots and are not mutually orthogonal. 

One can develop all of the important properties of modified Bessel functions using 
straightforward, but often tedious, analysis based upon cylindrical Bessel functions, but in 
a belated attempt for brevity we forgo the details and merely quote some of the results. 
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X 
Figure 8.8. /, solid, K, dashed. 
The basic recursion relations are 
2v 
Lal] - 4a k] = kl (8.245) 
2v 
K, lz] - K,,,[z] = -76L (8.246) 
dI 
haldt “rg = HE (8.247) 
z dz 
dK 
Kail F K, = -2E (8.248) 
Zz dz 
d +y +y 
AS Liz) ==" da (8.249) 
d 
de (Ka) = -z*”K,-[z] (8.250) 


where the sign differences between relationships for 7, and K, arise from the v dependence 
of the conventional phases. The leading asymptotic forms are 
-1/2 


Z € Kiga loe (8.251) 


Van” 2 


while the power series for J, takes the form 


Relz] = 0 = £ [2] = 


2 


zw 0° 1 2 m 
Hz] Ela) al) (8.232) 


m=0 
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Like the Neumann functions, one can show 
T 
K,[z] = 5 Cselva] (1.,[2] - 1,121) (8.253) 


for nonintegral v or 


K [z] = o lim (8.254) 


n von 


ôl [z] aliz] 

ð æ ) 
for integer order. The power series for K, is cumbersome, but we note that the divergence 
near the origin is given by 


TD] /z\” 
Relv] > 0 = K,lz] = = (5) (8.255) 
Z 
Kolz] = -y — Log [5] (8.256) 
where Euler’s gamma constant is given by 

m 1 
y= lim > — — Log[m] | ~ 0.577 216... (8.257) 

m>% = k 


8.3.5 Spherical Bessel Functions 


Next we consider spherical Bessel functions. In spherical coordinates the radial part of the 
wave equation takes the form 


u”[r] + G = ae 2) u[r] = 0 = an = Aj, [kr] + Bn,[kr] (8.258) 
where 

jx] = [Er [x] (8.259) 

[x)= Jin [x] (8.260) 


are spherical Bessel functions of order /, with j, regular and n, irregular at the origin. Nor- 
mally / is a nonnegative integer and the radial variable is nonnegative for physics applica- 
tions, but it can still be helpful to generalize these definitions to include complex arguments 
and arbitrary order. Thus, the spherical Bessel equation then takes the form 


v(v + 1) 


2 
J t= zf El + (1 - ia = 0 = fl] = Aj,[z] + Bn, [z] (8.261) 


with solutions 


rs JA E 

Jz] = J z7? Jy tle] (8.262) 
| T -1/2 

nz] =] 57 N,,1 [2] (8.263) 
2 2 


whose major properties can be derived easily from those for cylindrical Bessel functions. 


z 
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Figure 8.9. Spherical Bessel functions. 


Spherical Bessel functions, like their cylindrical cousins, are entire functions of the 
complex variable v even though most physics applications only require nonnegative inte- 
gers v > / > 0. For integer /, j,[z] is an entire function while n,[z] has a pole of order/ + 1 
at the origin. For arbitrary v, both functions have a branch point at the origin and a cut nor- 
mally taken below the negative real axis, but are analytic everywhere else. Although most 
authors express the proportionality factor between j, and J,,¡,, as y1/2z, that notation is 
ambiguous on the negative real axis. Therefore, we prefer to use 271? instead of V1/z to 
ensure the proper phases on the negative real axis. With our convention the parity of ¿,[x] 
is (-), just as it is for J,[x], whereas the more common V1/z notation results in an addi- 
tional negative sign for x < 0. Figure 8.9 shows that spherical Bessel functions for integer 
I and real arguments are oscillatory with interleaved roots. 

The asymptotic behavior of spherical Bessel functions are trigonometric functions 


zj,lz] = Sin|z - z] (8.264) 
zn,[z] = Cos |z - =] (8.265) 


similar to those for cylindrical Bessel functions. In most physics applications we encounter 
spherical Bessel functions in the form j,[kr] where k is a positive wave number and r a 
positive radius, such that their oscillation amplitudes decrease asymptotically according 
to (kr)~'. However, for more general arguments the imaginary parts of either z or y result 
in exponentially growing contributions that must be handled carefully and may limit the 
usefulness of these asymptotic formulas. 

The basic recursion relations take the form 


2v+1 
fall + frail] = F FL (8.266) 
Yf, ¡]- + Df, [2] = @v + DAL] (8.267) 
AA =27"f k] (8.268) 


dz 

d 

gE fle) = =z” fk] (8.269) 
Z 
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where f € {j, n, h®, h®} is any family of solutions to the spherical Bessel equation or any 
linear combination with coefficients independent of v or z. These can be obtained from 
the corresponding results for cylindrical Bessel functions or from the differential equation. 
Notice that 


si 
jk] = ae (8.270) 
nk] == a (8.271) 


provide solutions to the spherical Bessel equation for v = 0. By inductively applying the 
recursion relations, we then obtain Rayleigh’s formulas for nonnegative integers 


ia = Cad ( 2i mE (8.272) 
nel = ~(-2 (= a ae (8.273) 
h [z] = (o (: a] E (8.274) 
h [z] ==! (2 a] o (8.275) 


that resemble Rodrigues” formula. Explicit formulas can now be developed for positive 
integers, which is the most important situation. 


Using 
Jal = (5) > l 2y (8.276) 
Y 2 LTE +m + 1] 11m +1] 4 
one finds 


i E Vn zy a 1 (-5) 
del = 2) Litpems itm 4 er) 


m=0 


immediately. For integer v > / we use 


1) -Ql-Du 
rfi+5|=va F (8.278) 
to obtain 
00 1 2 m 
To] = = 8.279 
ile ‘> aims 5) eer) 


Similarly, for integer order we use 


N, [2] = Cot ( + 5)” Jp} zl- Cse ( + 3)1| Jy A=, 162] (8.280) 
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such that 
nz] = 0 [Pan T y : Y (8.281) 
i 2 2 GT|m-1+3|m 4 
and 
1 (—2)! 
T|- -|= = 8.282 
E | Vor ( ) 
to obtain 
a1 © = 2m - DU 2)" 
=- 8.283 
nz] = -z 2, a 5 (8.283) 
Thus, the limiting behavior near the origin is described by 
1 
i z (21-1)!! 
Iz] «1 = jk] = Or DI nz] = A a (8.284) 
where 
n!! =n (n- 2)!! (8.285) 
(=D!! = 01! = 11! =1 (8.286) 


is the double factorial function for nonnegative integers. 


8.4 Fourier—Bessel Transform 


The plane wave e, where z = rCos[6] is the distance along the polar axis, can be 
expanded in a Legendre series according to 


Explikr Cos[6]] = >> /;[kr]P,[Cos[9]] (8.287) 
1=0 


where the coefficients f, depend upon kr, treated as a parameter for the purpose of the 
Legendre expansion. Using the orthogonality properties of Legendre polynomials, we find 


1 
fikri = E f Explikrx]P [x] dx (8.288) 
a 


The most direct method to evaluate these coefficients is to expand the exponential 


2+1 SG (ikr)™ f! 
fikri = = we) f x"P [x] dx (8.289) 
2 m! = 


m=0 
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and to use a result 


a m! (m-1-D! 
m+neven => E Ae mada Du (8.290) 


that is proven in the exercises using Rodrigues” formula and partial integration. Thus, 
a leading power (kr) can be factored out and the remaining series contains only even 
powers, (kr) for m = 0, such that 


| co ; 2j- D)! 
m>1+2j = filkr] =(21+ beet ) Ne Qplej+2+ Dn 
= (21 + 1)(ikr) > raro 2 ) 


j=0 
= (21 + 1)é j,[kr] 


is proportional to the power series for j,[kr]. Therefore, we obtain Rayleigh’s expansion 
Exp [ikr Cos[6]] = a + 1) ¡,[kr]P, [Cos[8]] (8.292) 
1=0 


for a plane wave. (This is sometimes named Bauer’s formula instead.) Finally, the Legendre 
addition theorem 


Y [7] - Y, [7] (8.293) 


provides a more symmetrical form 
Explik -7] = any ALTA VALS) YF] (8.294) 
1=0 


which depends upon the angle between the & and ? directions and is independent of the 
orientation of the coordinate axes. This very important formula finds myriad applications, 
including radiation and scattering theory, and should be committed to memory. 

Consider a three-dimensional Fourier transform 


FIX] = f fF] Explik -7] dr (8.295) 


where the integral extends over all space. It is useful to perform a multipole expansion 


00 1 
IEI = Y, D, AAA (8.296) 


1=0 m=-1 


Sink? = [rmx tordo (8.297) 


308 8 Legendre and Bessel Functions 


for the angular dependence of f[7]. Applying the Rayleigh expansion and assuming that 
the order of integration and summation can be interchanged, such that 


fik] = myi f SVT - Y 171 ar (8.298) 
1=0 
and using the orthogonality of spherical harmonics, we obtain 
FIK] = 4r y > iF LAMY, mlk] (8.299) 
1=0 m=- 
where 
Finltl= f Anlrijikr dr (8.300) 


is the Fourier—Bessel transform of the radial dependence of the /, m multipole. 
Alternatively, we could apply the Rayleigh expansion to the inverse Fourier transform 


` sm aau k 
f= | J Exot- i ES (8.301) 
to obtain 
D Pe 
fnll = = dl han De] jkr? dk (8.302) 


where the numerical coefficient is simply (4102/QnY. Substituting the definition for Fnk] 
into 


2 co foe) 
ale f dkk? f dr'r’ f, „lr 1j [kr']j [kr] (8.303) 
0 0 
we require 
2 00 — y 
z f atian e (8.304) 
T Jo rr 


to produce the necessary radial delta function. Although this is an admittedly heuristic 
derivation, the same orthogonality condition can be obtained more rigorously by evaluating 
the limit of the orthogonality integral for Bessel functions over a finite range as that range 
becomes infinite. Thus, the completeness of the Fourier—Bessel transform is expressed in 
spherical coordinates as 


ar-r = Y Dep] (8.305) 
rar)? T © kA? glk jk (8.306) 
rr T Jo 


ól- 7] = >) YI YF (8.307) 
l 
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8.4.1 Example: Fourier—Bessel Expansion of Nuclear Charge Density 


The differential cross-section for scattering of a high-energy electron from the charge den- 
sity of an atomic nucleus takes the schematic form 


do 
30 = + Lal (8.308) 


where o, is the cross-section for a point charge, / is the angular momentum transfer, q is 
the momentum transfer, and F [q] is the form factor. The form factor 


Flal= { plrl lar? ar (8.309) 
0 


is the Fourier—Bessel transform 


rada ura 
A Hl Mi 


k=1 


plr] = [s j (8.310) 
of the transition charge density, which is a reduced matrix element of the charge density 
operator between initial and final states i and f. We neglect convection current, magne- 
tization, and other complications in this introductory discussion. In its simplest form the 
charge density operator sums over all Z protons in the nucleus where the radial delta func- 
tion specifies their positions. Given measurements of the form factor for a set of momen- 
tum transfers in the range dyin $ 4 € Ymax» how does one reconstruct the radial charge 
density p,[r]? If the measurements produced a continuous function over an infinite range, 
one would simply use the orthonormality relation for spherical Bessel functions to invert 
the Fourier transform according to 


J fal 5 
plal== f Flq1j/Larlg” dq (8.311) 


but experiments are limited in range and provide only discrete points with finite precision. 
A more practical method is to expand the radial density in a complete set of basis functions 


plr] = by a,,P; ql] = Filq] = y An [q] (8.312) 
n=1 n=1 
where 
Pinlal = f Pınlrljilarir dr (8.313) 


and to use the method of least-squares to fit the coefficients a, to the data. 
Recognizing that the charge density occupies a finite volume, a common choice of 
radial basis functions is the Fourier—Bessel expansion (FBE) 


Pipl] = lg, r1Olr=Rl, Jld; R] = 9 (8.314) 


with Dirichlet boundary conditions. Here R should be large enough to comfortably enclose 
practically all of the charge but not so large that there are too many q, , < 4 ax to determine 
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Figure 8.10. FBE form factor basis. 


the corresponding expansion coefficients. An important feature of this expansion is that the 
basis functions for the form factor 


2 GF NIRV lq, R] = qjildi„R]j [GR] 


(8.315) 
Y - Gn 


R 
Piolal = | ¡taurlitari? dr = R 
0 


exhibit a strong peak around q,,, and small oscillations elsewhere. Therefore, reasonable 
approximations to the expansion coefficients can be obtained by inspection of the data 
for q = q,,,- Representative basis functions are displayed in Fig. 8.10 for / = 4. The domi- 
nant peak moves out as n increases. As R increases it moves inward and becomes taller and 
narrower, approaching a delta function in the continuum limit R > oo. Clearly experimen- 
tal data cannot determine the amplitude for oscillations with q >> q,,,,; therefore, physics 
arguments are needed to estimate the uncertainty in the reconstructed radial density due 
to the unmeasured form factor for large momentum transfers, q > Q,,,x, Known as incom- 
pleteness error. However, a discussion of that issue would bring us too far from the main 
topic. 


8.5 Summary 


Here we collect some of the most useful results for Legendre and Bessel functions in order 
to provide a convenient reference for later work on boundary-value problems. Additional 
formulas are found in the problems, the chapter, and standard compendia. Unless stated 
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otherwise, we assume (n, /, m} are nonnegative integers, {x, t} are real, and {y, z} are arbi- 
trary (possibly complex). We also assume that (a, b} are constant coefficients independent 
of the degree or order of any functions. Note that some of these results are proven in the 
exercises or are generalizations of results obtained in the text. When working problems, 
do not quote without proof any results beyond those presented in the text itself. 


8.5.1 Legendre Functions 


8.5.1.1 Generating Function 


1 co 
TÉ E P_[x]t" (8.316) 
VI-t +P 2 
1 >= re 
—— Y <P 317 
PP] ra nl a 
1 d K 2 n 
Pla = 2) (2-1) (8.318) 


8.5.1.2 Orthonormality 


1 

i P,[x]P,,[x] dx = ———_,,,, (8.319) 

-1 2n+1 ” 

8.5.1.3 Differential Equation 

d-2£ Td - 2zf'[z] + vv + Dflzl = 0 => fla = aP, [a] + boiz] (8.320) 

8.5.1.4 Recursion Relations 
(V+ Df, —-Qv + Def, + vf,  =0 (8.321) 
Fat 22h +h = Sy (8.322) 
fafa = + Df (8.323) 


Z- DE = vzf,-vf,_) (8.324) 
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8.5.1.5 Special Values 


P] = 1 
P,[-x] = (-)"P, [x] 


(8.325) 
(8.326) 
n (2n)! 
Pon = apa? Pml = 0 (8.327) 
8.5.1.6 Integral Representations 
257 (2 =1)"-. 
= 8.328 
P] Oni (t = gra ( ) 
2 (?-1y 

= d 8.329 

2,121 4iSin[vz] J (z-1)""! ' ( ) 
See Fig. 8.3 for contours. 


8.5.2 Associated Legendre Functions 


8.5.2.1 Differential Equation 


2 
a -AF Td -22f [a + (o Te 2) tel =0=> f, [2] = aP, [z] + bQ, lz] 
l-z 3 


(8.330) 
8.5.2.2 Orthonormality 


: _ 2 (+m)! 
f Ampl a 


(8.331) 
8.5.2.3 Rodrigues? Formula 


d m 
Panel = a 2 (E) Ba 


(8.332) 
mdm)! 
Pim] = (-) qF ml on] 


(8.333) 


8.5 Summary 


8.5.2.4 Special Values 


Fall] E P[+1]6,,0 = EDS 


min (2m)! 
A mlO] =(=) : omn mln! 


| [mn +2k - 1) 
k=1 
Pon+m+1.mL01 =0 


8.5.2.5 Recursion Relations 


2 
P mol] + On + DP [x] + (10 + 1) — mEn + 1))P,,,[x] = 0 


XP, in + g -m+ Da ae P aai E Piim = 0 
Prime B (21 F Da =r Bi ~ Prims = 0 
U—=m+ DP 579 I+ DP, m +O +MP im == 0 
8.5.3 Spherical Harmonics 
8.5.3.1 Definition 
21+1(@-m)!)!? ee 
Yin 16, 6] 7 ( Ar (a + = P, „leos Oje 


8.5.3.2 Symmetries 


Yin [0, ġ] = Era [6, o] 
Y, ml-°] = (-)'Y, 17] 


8.5.3.3 Special Values 


21+1 
4r 


1/2 
Yin 6, 0] = ( P,[cos AO 


Hi 
m,0 


Y nll = Yn {0.01 = (= 
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(8.334) 
(8.335) 


(8.336) 


(8.337) 


(8.338) 
(8.339) 
(8.340) 


(8.341) 


(8.342) 
(8.343) 


(8.344) 


(8.345) 
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8.5.3.4 Orthonormality 


f Pon PY en [°] dQ = 617 Ôm m 


8.5.3.5 Addition Theorem 


An 
Pil? 7] 1= gT È Yl Mil it 21+1 


4n Y 
= = are HAL YP] 


8.5.4 Cylindrical Bessel Functions 
8.5.4.1 Differential Equation 


Zf" + ef ld + (2 -yfe = 0 => Ak 


8.5.4.2 Series 


zy e 1 
ile] = (5) Li T+ m+ itm + 1] (- 


N,[z] = Cot[va]J,[z] — Csc[va]J_,[z] 


8.5.4.3 Hankel Functions 


A(z] = J [z] + iN [z] 
H(z] = J,[z] - iN, [z] 
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(8.346) 


aa ae | (8.347) 


(8.348) 


= aJ,[z] + bN,[z] (8.349) 


(8.350) 


(8.351) 


(8.352) 
(8.353) 
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8.5.4.4 Asymptotic Forms 


Ile] = | Že" Cos |z- (Qv+ D7] (8.354) 
n 4 

N iz] =] 2 172 Sin E -(2v+ D7] (8.355) 
n 4 


HO[z => a Exp lé(z-@v+ DZ) (8.356) 


ie 
ie 


Hz] = E 12 Exp|-i(z -Qv+ DJ) (8.357) 


8.5.4.5 Recursion Relations 


2 
f, ¡Lal + ffiz] = = fle) (8.358) 
fal- fkl = 28,12] (8.359) 
d +y +y 
LE fi = t ll (8.360) 


8.5.4.6 Orthonormality 


kJ [k  R]J/ [kR] — kJ [KRIS [k R] 
k = k2 
1 2 


R 
| Jiki EW [ele dE = R 


O[k —k’] 
VEK 
oe =i] 
VEE 


f ¡REL [Ele dé = (8.361) 
0 


f > J [KEV [ké lk dk = (8.362) 
0 


8.5.5 Spherical Bessel Functions 
8.5.5.1 Differential Equation 


316 8 Legendre and Bessel Functions 


fz] + “Ftd + (1 = — >) Fl] =0= fla = aj,[z] + bn, [2] (8.363) 


Jylzl = zeta (8.364) 
TE 
n,[z] = fe MAN Lal (8.365) 


2 
AM [z] = 5d + in, la = [Bera (8.366) 


ae 
A(z] = j,[z] — in,[z] = Prato (8.367) 


8.5.5.2 Recursion Relations 


2v +1 
Salad + fak] = 7 FT (8.368) 

vf, [2] - (v + DA, E] = @v + DAL] (8.369) 
(ea) =2* f, lz] (8.370) 

-a FA) ==2"f,,, [z] (8.371) 


8.5.5.3 Orthonormality 


R i 7 _ A i 
f jikrlj iori dr = REMERA AREA (8.372) 
0 ki -k 
8.5.5.4 Series 
; Va z y co 1 2 m 
2 - 8.373 
jll = => (5) 2 rm: 3em 4 ió 


8.5.5.5 Asymptotic Forms 
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zj,lz] = Sin|z - 5 | (8.374) 
zn, lz] = Cos |z - =] (8.375) 
+1 
zh [z] = Exp li (< ws > =) (8.376) 
1 
zh® [z] = Exp |- (: = 5 =) (8.377) 
8.5.6 Fourier—Bessel Expansions 
Explik - 7] = amy, i' j [kr\Y,[k] - Y, 17] (8.378) 
1=0 
Expliké Sin[¢]] = DJ, [Kkéle”? (8.379) 
Problems for Chapter 8 
1. Rearrangement formulas for doubly infinite sums 
a) Prove that the rearrangement formulas 
œo œ o Pp œ |r/2] 
> > anm = 5 > Ad p-q = y As p25 (8.380) 
m=0 n=0 p=0 q=0 r=0 s=0 
where 
reven = |r/2] = 5 (8.381) 
r=1 
rodd = |r/2] = Sa (8.382) 


is the floor function, are equivalent provided that the series is absolutely convergent. 


b) Use this result to supply the missing steps in the derivation of the Legendre series. 


2. Leibnitz’s formula 
Prove Leibnitz’s formula 


(2) cta» = X 


(") fom Lx] Lx] (8.383) 
m=0 


for multiple derivatives of a product. 


3. Special values for Legendre functions 
a) Evaluate P, [0]. 
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b) Evaluate P’[1]. 


c) Use the Schläfli integral representation to evaluate P,[1] for arbitrary v. 


4. Legendre integrals 
a) Use a recursion relation to evaluate 


1 
| Pilas (8.384) 
0 
b) Evaluate 
1 
[Pods (8.385) 
0 


for positive integer m. 


5. Legendre series 
a) Evaluate 


> P lx] (8.386) 
n+1 


b) Evaluate 


' PL] 
nl y (8.387) 
E vV1-x i 


c) Produce Legendre expansions for 6[1 + x]. 


6. Parseval relation for Legendre polynomials 
Suppose that f[x] is expanded according to 


00 


fal => a,P, lx] (8.388) 


n=0 


1 : . 
Express f i If [x]? dx in terms of the a, coefficients. 


7. Legendre expansion of powers 
Develop a Legendre expansion for powers, such that 


m 


x” = X AmnP, [x] (8.389) 


n=0 


by expressing the Legendre polynomials in terms of Rodrigues’ formula and using partial 
integration to eliminate derivatives. You may find the beta function 


I[pI lg] 


8.390 
Tip +q] one 


1 
Bip, q] = f PA -AT dt = 
0 


useful, but a proof is not required. 
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8. Legendre function P,[z] for arbitrary v 

Show that Schlafli’s integral representation for the Legendre function of the first-kind sat- 
isfies Legendre’s differential equation for arbitrary v provided that the contour and cuts are 
defined properly. 


9. Laplace’s integral representation for Legendre functions 
a) Suppose that the contour used in Schlafli’s integral representation is a circle around z 
with radius |Vz? — 1|. Show that 


1 T n 
Pla = f (2+ 12 = 1 Costøl) do (8.391) 
0 
b) Use Laplace’s integral representation to evaluate the Legendre generating function 
alt, 2] = $ "P (8.392) 
n=0 


Thus, one obtains a generalization for complex z. 


10. Generating function for associated Legendre functions 
Derive the generating function for associated Legendre functions with m = 0 


00 


Rm)! 1-A i 
gatal = (-)" = = Pre yy LANE (8.393) 
22% m1! (1 = 2rx +1?) +1/2 2, I+m, 


from the generating function for Legendre polynomials. It is not shown often because it 
appears cumbersome, but it can be useful in deriving relationships or performing integrals 
involving associated Legendre functions. 


11. An integral used in the normalization of associated Legendre functions 
There are several methods for obtaining 


l rp+1 _ 221 
=- 2 = = 
fa xX} dx WT 7 Ge) (8.394) 


for 7 = 0 (other than looking it up, of course). A relatively painless method that also 
provides many related integral is based upon the beta function 


_ Tipit) 


ST aa 


(8.395) 


where p, g are generally complex. For our purposes it will be sufficient to assume that p, 
q are nonnegative real numbers so that we can employ the simplest integral representation 


TIp] = { ew du (8.396) 
0 
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Show that the substitution u > x? and a transformation from Cartesian to polar coordinates 
facilitates evaluation of the numerator as an integral over the first quadrant, such that 


m/2 
p,q = 0 = Bip, q] = 4 f Cos[6]??"! Sina]?! dé (8.397) 
0 


It should now be a simple matter to obtain the desired integral using appropriate choices 
of p, g and an obvious change of variable. 


12. Recursion relations for associated Legendre functions 

Many recursion relations for associated Legendre functions can be developed by differ- 
entiating either the differential equation or a recursion relation for Legendre polynomials 
m times. Use this technique to derive the following relations. For simplicity, assume that 
m = 0 for any P,,,,. Observe that a) varies the order, d) the degree, while b) and c) vary 
both. It is probably easiest to derive these relations in the order listed. 


2x 
a) P mol] + (m+ 1 JT = P, ma LX) + (10 + 1) — m(m + 1))P,,,[x] == 0 (8.398) 
b) xP pn + @—m+ DAY PP ny — Pim == 0 (8.399) 
©) Prime ODA PP Pest ms = 0 (8.400) 
d) P= m+ 1)P in — (20+ DAP, p + P+ m)P_; », == 0 (8.401) 


13. m-raising and lowering operators for P,,, 
a) Use the Rodrigues formula to derive the m-raising relation 
x 


= 2\1/2 pr 
= -(1 - x)” P aa 22740 


l,m 


P, 


l,m+1 


(8.402) 


—m 


b) Then use a recursion relation to deduce the corresponding m-lowering formula. 


14. Special values for associated Legendre functions 
Use the generating function to evaluate the following special values or limiting cases for 
associated Legendre functions. Assume that m = 0. 


a) P mll] 

b) P,,,[0] 

c) Ps, mlx] for x = Cos[0]. Notice that this result can be used with the m-lowering operator 
to generate the entire set of P, ,, [x]; this is a common algorithm. 


15. Orthogonality of P,,, with respect to m 
The most useful orthogonality for P, „ concerns differing 7 but common m. Alternatively, 
one can show that 


f P, mlx]P; n [x] dx= 1 (+ m)! 


i 1-2 => m(-m)! m,m”? 


m 


(m, m > 0) (8.403) 


expresses orthogonality between associated Legendre functions differing in order. This 
result is generally less useful because orthogonality between azimuthal eigenfunctions 
usually ensures matching order anyway. 


Problems for Chapter 8 321 


a) Show that the associated Legendre equation is a Sturm—Liouville equation with eigen- 
value m and weight function w[x] = (1-12)7!? and, hence, provides this orthogonality 
integral between eigenfunctions differing in order. 


b) Obtain the normalization for P, „-; and use a recursion relation to step down m. 


16. Recursion relations for Bessel functions 
Suppose that f,[z] = aJ_[z] + bN,[z] where the coefficients a, b are independent of y or z. 
Verify the following recursion relations. 


a) Sula = 24115 fle) (8.404) 
b) + (2 fla) = 42°" fy [2] (8.405) 
c) ei (241) = 2~* f, iz] (8.406) 
d) A (La) = CF a (8.407) 


17. Interleaving of roots for Bessel functions 
Use the recursion relation 


d 
k hel) =e" ak (8.408) 


where f [x] = aJ,,[x] + bN, [x] is a linear combination of cylindrical Bessel functions with 
constant coefficients to prove that there is one root of f,,, between successive roots of f, 
and vice versa. 


18. Rodrigues formula for Bessel functions 
Derive a Rodrigues formula of the form 


z” filz] = D" folz] (8.409) 
H 


n?’ 


where f, is a solution to Bessel’s equation (J,,, N, 


no Nie etc.) and D is a differential operator 
applied n times to fọ. 


19. Orthonormality relations for Bessel functions and eigenfunction expansions 
Piecewise continuous functions on a finite interval that satisfy linear homogeneous bound- 
ary conditions can be expanded in terms of eigenfunctions for Bessel’s equation. Here we 
develop the required orthonormality relations for simple boundary conditions and formu- 
late the corresponding expansions. 


a) Use the differential equations satisfied by J,[k,€] and J,[k,€] for arbitrary k, and k, to 
show 
R 
Rk S[k, RW 1k R] — k 4, [RISA RD = (kj - ef J [kél [RK élé dé 
0 
(8.410) 


for v = 0. Differentiation of this result and application of recursion relations or the 
differential equations can then help with specific boundary conditions. 
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b) Derive the orthonormality relation 


R? 


5 (8.411) 


J k, RYO 


v,n m,n 


R 
f J lk, mS WTA, 616 dé = 


where J,[k, ,R] == O defines the eigenvalues k,,, for Dirichlet boundary conditions. 
Provide an explicit expression for the Bessel expansion of an arbitrary function satis- 
fying Dirichlet boundary conditions. 


c) Derive the orthonormality relation 


y 


R R? 2 5 
T J Ty mo Wy TK 1616 dé = 2 í ~ (5) Jot Omn (8.412) 


where J/[k,,,R] == 0 defines the eigenvalues k,,, for Neumann boundary conditions. 
Provide an explicit expression for the Bessel expansion of an arbitrary function satis- 
fying Neumann boundary conditions assuming that v + 0. 


d) How is the Bessel expansion using v = O for an arbitrary function satisfying Neu- 
mann boundary conditions affected by the null eigenvalue kọ = 0? Show that the 
corresponding eigenfunction is orthogonal to those with k, > 0 and formulate the 
appropriate expansion explicitly. 


20. Continuum orthonormality for Bessel functions 
The continuum orthonormality relation 


ólk — k'] 
VEK 


for Bessel functions with v = 0 on a semi-infinite range can be derived by careful analysis 
of the limit of the result for a finite interval 


E JEJEJE dé = (8.413) 
0 


kJ, [RISA] — k 4,1% R1S7[%, R] 
R-k 
1 2 


R 
T J [ky FV [$18 dé = R (8.414) 
as R > oo. This result should have been obtained in the preceding problem. Use a recursion 
relation to eliminate the derivatives first and then substitute the leading asymptotic behav- 
ior for the remaining Bessel functions. (Why eliminate derivatives first?) After some alge- 
braic manipulation, more easily performed with MATHEMATICA than by hand, one should 
recognize a familiar nascent delta function. 


21. Parseval relation for Bessel functions 
Suppose that f[x] is expanded according to 


00 


fla] => aA [k,l], 4,[K,,R] = 0 (8.415) 


n=1 


: R : : 
in the range 0 < x < R. Express Í, Flal?x dx in terms of the a, coefficients. 
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22. Summation formula for Bessel functions 
Use the generating function to evaluate J,,[x + y] for integer n. 


23. Laplace transform of Bessel functions 
Use Bessel’s integral for J, [x] to evaluate the Laplace transform for Bessel functions of 
integral order. 


24. Bessel’s integral for noninteger y 
Show that 


Jal = z f : Cos|z Sin[6] — v6] d0 — u f i Exp|-(z Sinh[s] + vs)] ds (8.416) 
0 1 


for a suitable domain of z. 


25. Plane wave in cylindrical Bessel functions 
Show that a plane wave can be expanded in terms of cylindrical Bessel functions according 
to 


Exp|iké Sin[6]| = y J [k€]e*r? (8.417) 


m=-—00 


26. Leading asymptotic behavior of Hankel functions 

Apply the method of steepest descent to deduce the leading asymptotic behavior of Hankel 
functions from their integral representations. Then deduce the corresponding behavior of 
Bessel and Neumann functions. For simplicity assume that |z| >> lv] (why?). 


27. Power series for Neumann functions 

Complete the derivation of the power series for Neumann functions with integer n = 0 
using the fact that gamma functions have simple poles with known residues at negative 
integers. 


28. Wronskians for Bessel functions 
Recall that the Wronskian 


1 
WIF, el = fzlg [zl — gizl fiz] « == (8.418) 
plz] 

for two independent solutions f, gto a Sturm—Liouville equation 

(£ + Awlz))f lz] = (8.419) 

(£ + AwlzDglz] == 0 (8.420) 
generalized to complex z where 

£= d ( £ 8.421 

= ple) - alz (8.421) 

takes the form 

WES, g1 — (8.422) 

plzl 


Confirm the validity of this result for Bessel functions and identify plz]. Then evaluate 
the following Wronskian relations between various Bessel functions. Also, express these 
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Wronskians for parts a) and b) as combinations of products of Bessel function of different 
orders, without explicit derivatives. 


a) WIJ,,J_,]. Explain the result for y > n, where n is an integer. 
b) WIN] 

co) WH, HO]. 

29. Recursion relations for spherical Bessel functions 


Suppose that f [z] = aj,[z] + bn,[z] where the coefficients a and b are independent of 
vorz. 


a) Starting with the corresponding relations for cylindrical Bessel functions, verify the 
basic recursion relations: 


2v+1 
Fa lzl + fR] = = fiz (8.423) 
Vf, [2] - + Df, [2] = Qv+ DAE] (8.424) 
falz] = Sa - fiz] (8.425) 

1 
f; lz] = SE + fiz] (8.426) 


Then demonstrate the following variations: 


d 

b) qe fled =2" FE (8.427) 
d 

c) gE fled) == fak] (8.428) 
va 
1 dyt 

d) Ez) ple) = ak (8.429) 
1da\ 

e) Ga (FLD = (OO fa la (8.430) 


30. Poisson integral representation for spherical Bessel functions 
Use 


-—] 1 
j [kr] = > f Explikrx]B[x] dx (8.431) 
=j 


derived in the text to obtain the Poisson integral representation 


7 m 
jle] = P f Cos [p Cos[9]] Sin[9]?** da (8.432) 


yyy ô 


for real p and nonnegative integer /. 
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31. 


a) 


b) 


c) 


Orthonormality relations for spherical Bessel functions 
Use the differential equations satisfied by j,[k,r] and j,[k,r] for arbitrary k, and k, to 
show 


R 
R? (ka ilk R] J; [k R] — ky jilk,RIj [k RD) = (ki o f hlkirljlkrir dr 
0 


(8.433) 
for/ = 0. 
Derive the orthonormality relation 
. di A . 2 R? ` 2 
Ji [kR] === A Ji [knr] Jı [knr] rdr = z Im [KR] Oven (8.434) 


for Dirichlet boundary conditions. Provide an explicit expression for the spherical 
Bessel expansion of an arbitrary function satisfying Dirichlet boundary conditions. 


Derive the continuum orthonormality relation 


O[k — k'] 
kk’ 


2 '00 
= f jikrlj[k' rir’ dr = (8.435) 
0 


9 Boundary- Value Problems 


Abstract. Several methods are developed for constructing solutions to partial dif- 
ferential equations that match boundary conditions on a surface. We focus on equa- 
tions based upon the Laplacian operator, such as the Helmholtz equation, that are 
expressed in separable orthogonal coordinate systems. Eigenfunction methods can 
then be used to develop series expansions or integral representations. Green func- 
tions provide general solutions for inhomogeneous problems. Examples include elec- 
trostatics, magnetostatics, and scattering theory. Exercises at the end of the chapter 
explore a wider variety of problems. 


9.1 Introduction 


We are often faced with the problem of solving a partial differential equation within a vol- 
ume V bounded by a surface S subject to boundary conditions that specify the solution 
and/or some of its derivatives on S. Since neither time nor space nor patience nor expertise 
permits an exhaustive survey of this extremely broad subject, we shall be content to study 
equations based upon the Laplacian operator expressed in separable orthogonal coordi- 
nate systems. We take as a prototype the inhomogeneous Helmholtz equation 


(V? + ws, t] == —Aaplr, t] (9.1) 


which includes among its special cases Laplace’s equation (k = 0, p = 0), Poisson’s equa- 
tion (k = 0), the wave equation (k = w/c), the diffusion equation (k? > —«), and the 
Schrödinger equation (k? = 2mE, —4rp = 2mVw). In the happy circumstance that the 
boundary consists of surfaces on which one of the coordinates is constant, such problems 
are often amenable to the method of separation of variables. 

Rather than plunging directly into the general theory, we illustrate the basic strategy 
by solving Poisson’s equation 


VWF] = —4zp17] (9.2) 


within a rectangular box {0 < x < a,0 < y < b,0 < z < c} with specified potentials 
upon its surfaces. We can divide the problem into several parts. First, we determine the 
potentials w,[7, t] that satisfy Laplace’s equation 


Vy] =0, FES => y= PES > y= (9.3) 


i? 


where y, is specified by the two-dimensional function V, on surface S, and vanishes on the 
other five faces of the box. Then we determine the Green function that satisfies 


V2GI[F, 7] = —4n6[7 - 7] (9.4) 
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with boundary conditions 
YIO, y, z] == yla, y, z] == Whe 0, z] == Wh, b, z] == yx y, 0] == Y lx, y, c] = (9.5) 


on the surface of a grounded box. Finally, recognizing that Poisson’s equation is linear, we 
can superimpose these solutions to obtain the general solution 


6 
W= | Gti. lav’ + Xu 0.5) 
v i=l 


for an arbitrary internal charge distribution and arbitrary potentials on each of the six sides 
of the box. 


9.1.1 Laplace’s Equation in Box with Specified Potential on one Side 


Suppose that we wish to solve Laplace’s equation V?y == 0 in a rectangular volume {0 < 
xsa,0<sysb,0<z<c}withy = 0 on all faces except 


z =c => VLx y, c] = VLx, y] (9.7) 


This problem is ideally suited for the method of separation of variables, wherein we pro- 
pose a factorized solution of the form 


yix, y, z] = X[x]Y [y]Z[z] (9.8) 
for which 


lex 1Y 122 _ 


Vy =0 = + + = 
Y Xd Yd Zd 


=0 (9.9) 


Recognizing that each of the three terms is a function of a different variable, each must 
separately be constant, such that 


1 PX y IY _ p 142. 
Y d 


= Zd? 


xa? == = Y (9.10) 


, 


where the separation constants {—a”, —f?, y?) must satisfy 
e+e=y (9.11) 


Note that the separation constants were chosen with the foresight of experience (disingen- 
uous hindsight), but lacking such foresight we would have used simply {A, B, C} and then 
discovered later their most natural interpretations. The boundary conditions with respect 
to x, y are satisfied using 


X[0] = X[a] = 0 = X,[x] =Sin[o,x], a,=— (9.12) 


n 


Y[0] = Y[b] = 0 => Y, ly] = Sin[B,.y1, Bm = — (9.13) 
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with integer m, n, while the lower boundary condition on z requires 


Z[0] == 0 = Z[z] = Sinh al Vom = V+ (9.14) 


A linear superposition of solutions of this form then gives 


yix y, 2] = Y Ay Sinlo,] Sin[f,,y] SinhLy, 21 (9.15) 


nm 


where the upper boundary condition on z requires 


4 a b 
f ax f dy Sin[a,x] Sin[6,, y]V [x y] 
c] Jo 0 


yix, y c] [x y] na ab Sinh[y,,,, 
(9.16) 


Solutions to more general problems in which nonzero potentials are specified on more than 
one face can then be constructed simply by adding several solutions of this type. 


9.1.2 Green Function for Grounded Box 


We can reduce the three-dimensional equation 

V?GIP, 7] == —4n6[7 - 7] (9.17) 
with boundary conditions 

yO, y, z] == yla, y, z] == Whe 0, z] = Whe b, z] == yx, y, 0] = Y Lx, y, c] = 0 (9.18) 


to a one-dimensional equation by using the expansions 


PONEY „2 An i nr. . , 
ó[r -F'] =6ólz2-2'] E y Sin[a,x] Sin[a,x ] f y Sin[£ ¡y] Sin[£ ¡y ] (9.19) 
i=l j=l 
G[r, 7] = 3 8; j z'] Sin[a;x] Sin[a;x’] Sin[f ¡y] Sinf y] (9.20) 
ij=1 
to obtain 
or , lór y , : 
(x = oa) eke Z]= ap OF =z],  g,,10,7']= g lez] (9.21) 
where 
Y=% +B; (9.22) 


is the separation constant formed from two eigenvalues for the degrees of freedom that 
we chose to eliminate. The sine expansions satisfy the boundary conditions on four of the 
six sides automatically, but we must use the resulting one-dimensional inhomogeneous 
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Helmholtz equation to reconstruct the delta-function source. We chose to retain the z vari- 
able because the nontrivial boundary condition occurs on a side with constant z. 
Using solutions to the homogeneous Helmholtz equation 


Os<zs7 = g; lz z] =A, Sinhly, ;z] (9.23) 
Z <z <c = g; [zz] = B, ¡Sinhl[y, ¡(c — 2)] (9.24) 
that satisfy the boundary conditions one either side of the source and applying the matching 
conditions 

Ag; jz’, z] = 0 = B; ¡Sinh[y, ¡(c — 2’)] — A; ¡Sinh[y, ;2’] = (9.25) 

pee 167 , F 167 
Ag; lz z] = “SE = B, j Coshly, ¡(c — 23] + A; ; Coshly, ;z ] = aby (9.26) 

ij 


across the z = z’ interface, we find 


16x Simh[y, (c -z’)] pal Sinh[ y, ;2’] 


2 B = i 9.27 
t! a ,;  Sinhly, jc] td aby, ; Sinh[y, ;c] ( ) 
Thus, we find 
167 Sinh[y, z<] Sinh[y, ¡(c — z,)] 
ls ee a (9.28) 


aby, j Sinh[y, jel 


where z_ is the smaller and z, is the larger of z and z’. This form should be very familiar, 
by now, from our work with Sturm—Liouville systems. Therefore, we can assemble the 
entire result 


o 16 Sg. 
Gli, 7] = — J” Sinfe.x] Sinfe,’] Sin[B y] Sin[B y] 

ijel 
Sinhly, z<] Sinh[y, (e = 2,1 


Y, ; Sinhly, jc] 


(9.29) 


in the form of a piecewise continuous, two-dimensional Fourier sine expansion. The expres- 
sions may be lengthy, but the analysis is hardly more complicated than that for one- 
dimensional Sturm—Liouville systems. Two other similar representations can be obtained 
by simply permuting variables and their associated boundary conditions; the most conve- 
nient choice may depend upon the representation of the source density, p[7], in the Poisson 
equation. Notice that the reciprocity condition 


G[r’, 7] = G[r, 7] (9.30) 


is satisfied automatically. 
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Alternatively, we can employ Fourier sine expansions in all three dimensions 


ól- ř] = 3 Sin[a;x] Sin[a,x’] Sin[8 y] Sin[8 y] Sin[y,z] Sin[y,z’] (9.31) 


ij k=1 


GP, 7] = > g; e Sin[a,x] Sin[o¿x'] Sin[B,y] Sin[8 ;y'] Sin[y,z] Sin[yz’] (9.32) 
ij k=1 


where now we define y, = k1/c with a single index. Substituting into Poisson’s equation 
and using orthogonality to equate coefficients 
Pe E Sinja] Sin[8 y] Sinz 9.33 
(a; + B; T Vie )8i jk = -abe in[a¡x”] in[P y] in[y,Z ] (9.33) 
we obtain 


2m Y, Sin[a,x] Sin[a,x”] Sin[£ ¡y] Sin[£ y] Sin[y,z] Siniy,’ 
Gl?.7] = 327 [ax] Sin[@;x’] Sin[8 y] Sin[8 ¡y”] Sin[y,z] Sin[y,z’] (9.34) 


2 2 2 
abc ial Q; +B; +y 


Equivalence between Eq. (9.29) and Eq. (9.34) may be verified by performing a Fourier 
sine expansion of g, ¡[z, z’], whereby 


2° pa 
Si j,k = f 8; Z ] Sin[y,z] dz 
0 


C 

327 Sinh[ y, ¡(ce — z’)] Zz 
= Sinh[y, ,z] Sin[y,z] d 

ee Sinh[y, ;c] J, AA (9.35) 
Sinh[y, ;z] f "Sinhly, C- 9] Sininz] dz) 

Sinh[y, ¡c] la Vij z Wz] AZ 


Although it is not difficult to evaluate these integrals by hand and simplify the result using 
standard trigonometric identities, we prefer to let MATHEMATICA perform the ‘grunt’ work 
to obtain 
327 | Sinh [y, , (c-2")] f° sinh [y, ,2] Sin [yz] az 
+ 
aber, sinh [y, ,c] 


Simplify | 


Sinh [x,, 2] J: Sinh [x, ¿Ce - z)| Sin [xz] a| 
Sinh [y, ;c] | 


i 


h, > a +B}, Sin [y,c] > o} ) / / Simpli fy 


32x Sin [y,z’ | 
abc(a +g) +tabcy 


in agreement with our analysis based upon Poisson’s equation. 
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A point charge g located at position 7’ within a grounded box produces an electrostatic 
potential 


pir] = golr - 7] = y[F] = Gir, 7] (9.36) 


while a charge density distributed within the box produces 
am = f oriav (9.37) 
v 


where V denotes the enclosed volume. The integration can be performed using either form 
of the Green function, whichever seems simplest for the actual interior charge distribution. 
Of course, in real life, such integrations usually must be performed numerically outside 
the classroom. 


9.2 Green’s Theorem for Electrostatics 


In electrostatics or magnetostatics we are often given or seek to define either the potential 
or the field on some closed surface and then need to determine those quantities everywhere 
within the volume enclosed by the bounding surface. Although one could, in principle, 
compute the potentials by adding the contributions of all charges and currents, we often do 
not know the detailed distributions of charge or current outside the volume of interest. For 
example, if we use a battery to maintain a constant potential on an electrode, we probably 
cannot guess the distribution of charge on its surface except in the simplest of geometries; 
hence, we are faced with a boundary-value problem. Once we know the Green function for 
a point charge within the volume of interest subject to the specified boundary conditions, 
we could compute the distribution of surface charge induced upon the electrode by the 
interior charge density. Dirichlet boundary conditions specify the value of a scalar poten- 
tial y on the surface S while Neumann boundary conditions specify its normal derivative. 
Sometimes one encounters mixed boundary conditions for which the value is specified on 
some portions and the normal derivative on others. Cauchy boundary conditions specify- 
ing both the potential and its normal derivative are too restrictive for Poisson’s equation, 
generally precluding existence of a solution, but can be useful for other subjects. 

In this section we use Green’s theorem to construct formal solutions to either Dirichlet 
or Neumann boundary value problems for Poisson’s equation. First we review the deriva- 
tion of Green’s identities based upon the Gauss divergence theorem. Let A represent a vec- 
tor field within a volume V bounded by a surface S. The divergence theorem then states 


[ dav = faa (9.38) 


where dS = ñ dS is the directed element of surface area with A being the outward normal. 
If we choose A = WV¢ where ¢ and y are differentiable scalar fields, substitution of 


A =YV = V-A =YV% + Vy- Vo (9.39) 
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into Gauss’ theorem immediately yields Green’s first identity 


f (UV? + Vy - Vo) dV = $ WVo-dS = $ ye dS (9.40) 
where 

as Vo (9.41) 

on 


is aconvenient shorthand notation for the normal derivative, the component of the gradient 
in the direction of the outward normal to a surface. Interchanging ¢ and y and subtracting 
the new form of the first identity then provides the more symmetric Green’s second identity 


ð 
[uve-ovnav = puds- a3= pu -owas (9.42) 
Poisson’s equation for the electrostatic potential y takes the form 
Vil] = —4zp [7 (9.43) 


where p is the charge density found within volume V. The contributions of charges that 
might be found outside the volume of interest are represented by the boundary conditions, 
either the potential or its normal derivative on the bounding surface. It is useful to define 
the Green function G[7, 7’] to be the potential at 7 produced by a unit point charge at 7’ as 
the solution to 


VGI, 7] == —4x6[7 - 7] (9.44) 


subject to appropriate boundary conditions to be specified later. Next we apply Green’s 
second identity by choosing y to be the electrostatic potential and ¢ to be the Green func- 
tion, such that 


f (WIY]V?GT,F] - GP, FIV yP Dav’ 


- fur P] 


Using Poisson’s equation and the definition of the Green function this becomes 


2 


owt] 
G[r, + An! 


(9.45) 


sh f orir- ridi pl") av’ 


= por TA — G[r, AO, as’ (9.46) 
ón ón 


Thus, we obtain a formal solution to Poisson’s equation in the form of Green 's electrostatic 
theorem 


1 
TE a GIPP IpI” ]av" + > $ (GIF 77 
T 


Tf there were no bounding surfaces, the first term would represent the familiar electrostatic 
potential produced by a specified charge distribution. The second term then represents 


ocr 7] 


ae yr ]—_—) 4s’ (9.47) 
n 
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the contribution made by external charges that determine the conditions on the bounding 
surface. 

For Dirichlet boundary conditions specifying w on S, it is convenient to require that 
the Dirichlet Green function, Gp, vanish on S such that 


reS => Gpl, r] = 0 = 
0Gp T, Y 
els fo ol,” ol?’ av! = af ant as (0.48) 


The corresponding result for Neumann boundary conditions, specifying 0W/0n’ on S’, is 
slightly more complicated because we cannot simply require OG,/0n” to vanish on S’. 
The problem is that the average value of the normal derivative on the bounding surface 1s 
constrained by Poisson’s equation and Gauss’ theorem such that 


fè V'GIr, Idv = IA dV’ =$? G[r,7"]- ds’ (9.49) 
requires 
$ Boln r] gy = cin (9.50) 
ôn 


Hence, the simplest boundary condition we can impose upon Gy is 


OGyIF, r] 4r 
ERES 51 
on’ S an 


where here S is the area of the bounding surface. Thus, a formal solution to the Neumann 
boundary-value problem can be expressed as 


IGN P] _ 4n 


On’ S 


R (9.52) 
1 owl?’ 
e fo PPP + $ Gy rE 


ds’ 
On’ 


where (y), denotes the average potential on S. Often Neumann problems appear in exterior 
form where the region of interest is outside an inner boundary and can be interpreted as 
within an outer boundary that is taken to oo, such that 


reir ees (9.53) 


w= [arrimar ¿arta 


where only the inner surface provides a finite contribution. 


S > œ => (Y4)5 > 0, 


(9.54) 
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9.3 Separable Coordinate Systems 


The Laplacian operator can be expressed in a curvilinear coordinate system with 7 = 


(Ep $2 $3) as 


3 
1 ô (hhh, ow 
Vw = L 9.55 
f hhh; 2, 0, hy 0É, is 
where 
3 2 
Ox. 
k = mee 9.56 
Al) ram 
are diagonal elements of the metric tensor 
3 
Ox. Ox; 
8,;= —— (9.57) 
a Ly dE, OE, 


relating the cartesian coordinates (x,, X3, X3) = (x, y, z) to the curvilinear coordinates (€,, €,, 
&,). We consider a coordinate system 7 = (&,, €,, é) separable when a product function of 
the form 


WE), é» éz] = X [8,1X,18,14[€3] (9.58) 


permits the Laplacian to be expressed in the form 


Vw 7 1 od OX ll Hlé] ô 0X41] 
p= lel saya (eae) ea El ae) 
Fol5214 3183) 0 OX3[€3] 
9.59 
X [é] E, («tes DE, ) ( ) 
One can then separate the homogeneous Helmholtz equation 
Vy + ky = 0 (9.60) 
in a sequence of steps 
halé] d dX: [é] 
=À 9.61 
XI] dé, («tes dé, 3 ( ) 
hlé] d dX,18>] = 
rail) rate = (9.62) 
1 d dX [é] ae 
ús E [£,1 dé, (isa dé, Ji ais 


that produces three second-order ordinary differential equations coupled through the two 
separation constants A, and A). 

It turns out that there are 11 orthogonal coordinate systems that are separable in this 
manner. A comprehensive analysis can be found in the treatise by Morse and Feshbach. 
Here we will be content to illustrate the general method using just the most common of 
these systems: rectangular, spherical polar, and cylindrical. We have already discussed the 
rectangular case and now proceed to spherical and cylindrical coordinates. 
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9.3.1 Spherical Polar Coordinates 


Using the familiar spherical polar coordinates 


x = rSin[6] Cos[$] (9.64) 
y = r Sin[0] Sin[¢] (9.65) 
z = rCos[6] (9.66) 


the Laplacian takes the form 


1 0 ( ,0w 1 O la. n OY 1 Py 
Vy = (> )+ (s 0 J+ 9.67 
Y= ar" ar} * aa)? Asimek og? on 
We propose a separable solution of the form 
YF] = R[r]O[9]D[6] (9.68) 
for which the Helmholtz equation becomes 
10 {ðR 1 500 (5 2) 1 10% 2 
Sin[0 Kr =0 9.69 
Ror ( ar)* sn © 06 Aa)" nro **" (269) 


Recognizing that only one term depends upon ¢, we define a separation constant m using 


50 == -m > Q = etn (9.70) 
leaving 
1 ð (¿0R 1 10(, ,,00 m? ea 
al eee r S 71 
Ror ( a) + Sin[6] © 00 ( [6] aa Sin[9]? TK (9.71) 


Isolating the 9 dependence using a foresightful separation constant 
A A age 
Sin[9 75 (snr =) =m%0 == —1(1 + 1) Sin[0] © (9.72) 
and employing the transformation 


x = Cos[6] => sino) = -(1-2’) 


dO 
ams 9.73 
dx (9.73) 
we recognize the associated Legendre equation 


2 


(1-2) a Q +1)- Jo = 0 => O[x] = P,,, lx) Qp l (9.74) 


1-x 


whose solutions are combinations of regular and irregular associated Legendre polynomi- 
als. Finally, using 


ulr] 
FE 


Riv] = (9.75) 
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the radial equation reduces to the spherical form of Bessel’s equation 


(+1 
”4 (e ee 7 ) u=0= au = j,[kr], n,[kr] (9.76) 
r r 
whose solutions for k > 0 are composed of regular and irregular spherical Bessel functions. 
Therefore, general solutions can be constructed from appropriate linear combinations of 
the schematic form 
Alkr]) ( P, m| Cos[9]] \ ( es"? 9.77 

l=), Y, m de [kr] Orn Cos[6] e mó ( . ) 
where y,,,, is a constant coefficient that multiplies terms obtained from the appropriate 
choices from each of the three columns. Often the boundary conditions will eliminate 
one of the choices from each column, but otherwise we need to superimpose all possible 


combinations for the most general conditions. Similarly, depending upon the nature of k it 
might be more appropriate to employ modified Bessel functions or 


k>0 = jkr] or, nik] > ro! (9.78) 


Remember, this method provides a useful strategy, but it still must be adapted to the par- 
ticular features of the problem at hand. 

Often it is much more convenient to combine the angular functions into spherical har- 
monics and to use the identity 


2ulr ü (+1 
ae Y, mlO, ġ]= fu i= uri} Y, mlO, 0] (9.79) 
The Helmholtz equation then reduces to the radial equation 
> +1 
w’[r] +(e- = Ju [r] =0 (9.80) 


such that y[7] can be represented as a multipole expansion of the form 


a k 
WP] D Van y Yinll 8 (9.81) 


Lm 


where the w,,, are expansion coefficients and u [kr] is a conveniently normalized solution 
to the radial equation and its boundary conditions. Note that we have assumed, for sim- 
plicity, that the irregular Legendre functions can be discarded, but generalization should 
not be difficult. 

It is also useful to commit to memory some of the standard Green functions for open 
boundary conditions. Namely, for Poisson’s equation we find 


E. an Ye “< Y IM YIP] 0.82) 


cy POB rd a y] 
VGP. P] = —4n60F, 7] = GP ?1= == ai" 
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where we remind the reader of the convenient notation 


1 1 
Y17-X19]= Y, 10, OM nl, 9 = >) Y, 18, 61%, 10, 91] 
m=-1 m=-1 
(9.83) 


1 
= $ "Y, nl, 61%, 10 9] 


m=-] 


Similarly, for the Helmholtz equation one finds 


(V? + KGI, 7] == —4ró[7, 7] 


ae o. Exp[+ikl? - 71] . . (4) A a (9.84) 
= G7] = A a = +4nik Y jkr dar [kr 1,15] - Y, 17] 
7 
where hy = j, + in, are spherical Hankel functions for outgoing (+) or incoming (—) 


boundary conditions. The radial factors are obtained by interface matching with the aid of 
the Wronskian for the radial differential equation; we leave the algebra as an exercise for 
the reader. 


9.3.2 Cylindrical Coordinates 


Using the cylindrical coordinates 


x = €Cos[¢] (9.85) 
y = ¿Sin[g] (9.86) 
Z=Z (9.87) 


the Laplacian takes the form 

10 =) 18y y 
+ + 

¿0 k 0) EaP Az? 


We propose a separable solution of the form 


Vy = (9.88) 


y [+] = RIESIA] (9.89) 


for which the Helmholtz equation becomes 


+k? == 0 (9.90) 


110 OR 1106 10z 
l ) 2a y 
REOEV OE) OL ap Zax 


Separating first the fp dependence using trigonometric functions with periodic boundary 
conditions and then the z dependence using hyperbolic functions assuming finite range, 
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we obtain 
T == -mM => [z] € {e”"*, ein) (9.91) 
toe 3 ; 
Zag a, = Ziz] e {Sinh[a,,z], Cosh[a,,z]} (9.92) 
1 ð (_OR 2 nm) 
é OE Ga + (e +a — wr). =0= RIE] E Wi 18151, Nr 1851) (9.93) 
with E =K +a (9.94) 


where we have assumed, somewhat arbitrarily, that the separation constants {m, a, 6} are 
real; if not, we replace a trigonometric function by an exponential function or a Bessel 
function by a modified Bessel function. The schematic notation f € [f,, f2} indicates a 
suitable choice or linear combination of solutions that satisfies the appropriate boundary 
conditions. Therefore, complete solutions can be constructed from linear superpositions of 
the schematic form 
: imp 
a coa a) (ces) 095 
mn m n n 

Depending upon the boundary conditions, it might be more convenient to employ 
trigonometric functions for Z and then modified Bessel functions for R. Alternatively, 
sometimes it is more convenient to separate the R dependence first and then match across 
an interface with respect to Z. If the range of the angular variable is restricted, different 
boundary conditions and different angular functions could be needed. It takes experience 
to anticipate the optimum choices for a particular problem, and some of that experience 
can be acquired by solving the problems at the end of the chapter! In the remainder of this 
chapter we will concentrate upon spherical geometries, but many problems with cylindri- 
cal geometries are provided. Rather than attempt to apply the results derived here directly, 
it is usually better to perform the separation of variables for each problem anew in order 
to make informed decisions regarding the order of separation and the nature of separation 
constants. 


9.4 Spherical Expansion of Dirichlet Green Function for Poisson’s 
Equation 


We seek to construct a Green function for Poisson’s equation that vanishes on the concen- 
tric spheres r = a and r = b and satisfies 


V?G[P, 7] = -4n6[7 -7'], G=00nr=ab (9.96) 
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in the enclosed volume. It is natural to employ spherical polar coordinates and to make a 
multipole expansion of the form 


G[r, 7] = y gilr, r TX, 17 - Y, 17] (9.97) 
1 


Ó y 
dena SYA YF (9.98) 
1 


ó[?-7P]= 


E 
where g, satisfies the radial equation 


1Plrg) 1+1) 
r dr r? 


4 
gleri = lr =i] (9.99) 


Solutions to the homogeneous equation that satisfy either the inner or the outer boundary 
condition take the form 


r<r = g= a((2) - (17) (9.100) 


a a 


po a((Z = (7) (9.101) 


and are subject to matching conditions 
ryt y\—/-1 äg ny -l-1 
r r r r 
== B = == == A -= mi a al 2 
E (5) (5) ) (=) 5 ) iia 
4 B aN y\—l-1 A nt n11 
Ag =- 4, = 5(0(5) +0+1(5) )-S((5) +a+D(7) ) 
(r’) r b b r a a 


(9.103) 

across the interface. It is convenient to rewrite these equations as 
BA - p) = A(1 - a) (9.104) 
B( + BU +1) - AU + a(l + 1)) = 4r) (9.105) 

where 
21+1 pyr 
a=(“)" g= (>) (9.106) 
r r 


It is now easy, albeit tedious, to solve these equations 


sol = Solve [{B(1 - B) ==A(1-a),B(1+B(1+1)) -A(+a(1+1)) == -4n (r')™} , 
LA, BH] [[1]] / / Simpli fy 


{ y MEAN An(-1 +a) +) 
(1+21)(-a +8) ” (1+21)(a-ß) 
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and to construct the two pieces 
r\! r yl r\! r\-/-1 
{4((Z] ae }.»((F) sial Jf 7 sors. 
{x> (37 R> (35) / / Simpli fy 
pl lA E tte Gy er 
| arn (H 
anA + Gy) dae) eo 
- (1+27) (5% -(23%) | 


r 


Therefore, combining the two pieces, we obtain the Dirichlet Green function for the vol- 
ume between concentric grounded spheres as 


Ar 1 (b+! _ nian Loi E aut) 


9.107 
21+1 e perl = quit ( ) 


glr, r'] = 


where r_ is the smaller and r, is the larger of r and r’. Notice that this function is simply 
the product of the inner and outer solutions with a normalization based upon their Wron- 
skian; that general form should be familiar from our work on Sturm—Liouville systems and 
appears often in various guises. 

Several important special cases 


, An r r_\2+1 
1202412 (1-(2) (9.108) 
4n r ey 
b > œ => glr, r] = Iri ['-(2) | (9.109) 
, 4n rl 
NEE aan A arg el (9.110) 


can be obtained from the limiting behaviors when one or both of the radii assume extreme 
values. The first case listed above describes the interior of a sphere of radius b, the second 
describes the exterior of a sphere of radius a, while the third describes an open geometry 
without bounding surfaces. The result for open geometry should already be familiar from 
the multipole expansion 


1 4r r . F 
FFI = ya ell YF (9.111) 
1 > 


for the potential of a unit point charge in empty space. 

Given an arbitrary charge distribution p[7’] in the volume between concentric spheres 
with specified potentials y, = yla, 0, $] and Y, = W[b, 0, ġ], we can now compute the 
electrostatic potential anywhere within the enclosed volume by integrating 


WF] = | GÈ, 7’ |p?’ dv’ - = $ Wr’ JV’ GP, 7] - ds’ (9.112) 
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taking care to interpret the directed element of surface area as outward with respect to the 
enclosed volume such that 


r =a = dS’ = -e dO” (9.113) 
r =b = dS = +b dX” (9.114) 


where 7” is the radial unit vector. Thus, the second contribution to the potential produced 
by a point charge in the presence of a grounded sphere must represent the potential due to 
charges on the surface of the conductor. Several examples of the use of this Green func- 
tion are given below. Some of these examples are simple enough to evaluate symbolically, 
but generally numerical integration would be required for nontrivial distributions. Never- 
theless, a mathematician would consider the problem solved because numerical integration 
can be performed by computer in a straightforward manner. A physicist, on the other hand, 
would probably require a more practical demonstration that the numerical method works! 


9.4.1 Example: Multipole Expansion for Localized Charge Distribution 


Suppose that we seek the electrostatic potential 


n= f Gtr pt av’ (9.115) 
for distances r >> R where R represents the radius containing all of the charge, such that 
GF] = $ alr WAY] (9.116) 
1 
4n r da r 
i= 9.117 
ari Wael it 
The potential then takes the form 
7 = Y ny pp 9.118 
y[r] = ml ml? | (9.118) 
l,m 
where the multipole amplitudes are 
i k YY? 10, blpl#] dV (9.119) 
= A F . 
Um 21 # 1 l,m P 


The lowest nonvanishing multipoles tend to dominate for r > R. 


9.4.2 Example: Point Charge Near Grounded Conducting Sphere 


The potential due to a point charge near a grounded conducting sphere has two contribu- 
tions, one from the point charge itself and another from the distribution of surface charge 
needed to maintain the conductor at constant potential. One can show that the contribu- 
tion of the induced surface charge density is equivalent to the potential produced by an 
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image charge. Suppose first that the point charge is inside the conducting sphere, such that 
r,r' < b. By writing 


e (7 _ 1 (refs) _ (=) _ Tb r (9.120) 
PONE b\ p bip? per 
we obtain 
de AAA | 
a YIP]. YIP] = 2 — 121 
ES HHT = Ra a 


where °, = 7(b/r'Y is the location of an image charge outside a sphere of radius b induced 
by a point charge inside the sphere at 7’. Similarly, by writing 


r. 7 21+1 E 1 a 1+1 E 1 e 7 r r, 6 b 
ae E a ETs ~a\rr} ar 
we obtain 
4n rfa a r 1 
——— s< | __ YPY P = == 9.123 
7 zralé) A [+] a-r, ( ) 


where 7, = 7’(a/r’)? is the location of an image charge inside a sphere of radius a induced 
by a point charge outside the sphere at 7’. Therefore, the potential for a point charge at 7’ 
in the presence of a ground conducting sphere of radius R becomes 
2 
y= +H with g-i, À= E (9.124) 
r—-r| lr-7) r r 

where q, is the magnitude and 7, is the location of the image charge. The same expression 
is obtained whether the charge is inside or outside the sphere but, of course, the potential 
vanishes in the region on the opposite side of the conducting surface from the point charge. 

The actual surface charge density is determined by the normal derivative of the elec- 
trostatic potential according to 


1 4 
Sif EA 9.125 
d 4r ( Or r=R ( ) 
which, after simple but tedious algebra, reduces to 
$2. R2 
c- 4 Ir | (9.126) 


ATR (R? — 27RCos[6] + 2)” 


where Cos[6] = 7-7” is the polar angle on the sphere relative to the line between its center 
and the point charge. A simple calculation shows that the total induced charge 
1 -g, "<R 
2nR? f o dCos[8] =} 4 r (9.127) 
-1 —qR/r', r>R 


is not necessarily equal to the image charge. If the point charge is inside the grounded 
sphere the induced charge is equal to —q so that the net charge and the potential outside 
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r = 0.25R 
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Figure 9.1. Surface charge density induced upon a grounded sphere of radius R by a point charge at 


PF 


the sphere vanish. On the other hand, if the point charge is outside the sphere the total 
induced charge is inversely proportional to the distance from the center of the sphere. 
In either case, the induced charge is concentrated nearby when the point charge is near 
the surface and is spread more uniformly as the distance between the point charge and 
the surface increases. Figure 9.1 show the angular distribution of induced surface charge 
density for several values of r’/R. 

Although it is probably easier to obtain the Green function for a single grounded sphere 
using the method of images than the expansion in spherical harmonics, the problem of two 
concentric spheres would be rather difficult to solve using images because an infinite set of 
images is required. The expansion in spherical harmonics, by contrast, is no more difficult 
for a spherical shell than for a sphere and contains the latter as a special case. The Green 
function method is much more versatile than the method of images. 


9.4.3 Example: Specified Potential on Surface of Empty Sphere 


Suppose that y == f[0, 6] is specified on the surface of an empty sphere of radius R. The 
interior potential is then determined by 


ss R? OGIT, 7’ : 
Wr] == $ flO, ae) dQ. (9.128) 
An Or =R 
where dS' = ¥ R? d£Y is the directed differential area and 
G[r,7’] = y gilr, r IX, 17 - Y, 177] (9.129) 
7 
, 4n rL r \2+1 


is the Green function for the interior of a sphere. The radial derivative is simply 


0g lr ri) __4ary 
Tat 2 E (9.131) 
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such that 


r< R= Mro = X (E) Hal. 01 | Yin 10,0]110,6]407) (9.132) 


Lm 


can be evaluated numerically for any surface potential. Similarly, the exterior potential is 
determined by 


4r r 1: Og lr rI) 4 (RYO 
y= Le = — SE ) 9.133 
glr, r'] 21+1 qn | (4) ( or’ ea =p? ( ) 
Then, using dS’ = -PR? d as the outward normal to the boundary of the exterior region, 
we obtain 


y 
r>r= vrea (5 Y, le, A in (OO | 410,6 e']a0') (9.134) 


Lm 


These results can now be combined in the form 


yir 0,41 =>, rhn Y, „l6, 6] (9.135) 


l,m 


where now r, is interpreted as the smaller and r, as the larger of r and R and where the 
multipole amplitudes are given by 


Yim = f Yin 10, P1F10, 6] dQ (9.136) 


The same results could have been obtained by writing the eigenfunction expansions for y 
in the interior and exterior regions and matching the boundary condition at the surface 
without using the Green function. (Try 1t!) 

Suppose that a point charge is found near a sphere of specified potential. According 
to the superposition principle, the electrostatic potential can be constructed by adding the 
potential for a point charge near a grounded sphere to the contribution by a sphere with 
specified surface potential in otherwise empty space. More generally, if there is a charge 
distribution near a sphere with specified potential, one adds the contribution for each ele- 
ment of charge, dq’ = p[]dV' for a grounded sphere to the contribution of the surface 
potential, as represented by the formula 


1 
j= f GU, IAV = $ YPI? GP, 7] a3" (9.137) 


where the appropriate Green function is used in the interior and exterior regions and where 
dS’ is radially outward for the interior or inward for the exterior regions. The surface 
contribution can be represented by a multipole expansion while the net contribution of the 
charge density and its corresponding induced surface charges can be represented either by 
multipoles or by images. 
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9.4.4 Example: Charged Ring at Center of Grounded Conducting Sphere 


Suppose that a charged ring of radius a is centered within a grounded conducting sphere 
of radius b. It is convenient to equation the 2 axis with the normal to the ring, such that 


q 
2ra? 


pl] = —— lr - aló|Cos[9]] (9.138) 


represents the charge density. The electrostatic potential is determined by 


wil = a Gh, 7]ol?]4v' (9.139) 
with 
G5,7]=>)' glo, WAL YF (9.140) 
1 
4 y 21+1 
alt= 7% (1-(3) (9.141) 


Azimuthal symmetry with respect to ø’ limits the expansion to terms with m = 0, for 
which 


21 1 1/2 
Y, oL0, | = ) P/[Cos[9]] (9.142) 
Therefore, we obtain 
pl 21+1 
w=) 55(1-(2) } ror cost eee) 
1 > 


where r, is the smaller and r, is the larger of r and a. Although P,[0] can be evaluated in 
closed form, the series is not simplified thereby. Nevertheless, the potential can be evalu- 
ated numerically using a simple program; however, it is often necessary to include many 
terms in order to achieve the desired accuracy. An example is provided in Fig. 9.2 for a/b 
and summation up to/ < 50. 


9.5 Magnetic Field of Current Loop 


The current density for a circular loop of radius a in the xy plane can be expressed as 


y 


Iso (9.144) 
ó[r — a] 


J, = 16[Cos[8]] (9.145) 


where the azimuthal unit vector can be expressed in either Cartesian or spherical bases as 


$ = —Sin[¢]% + Cos[¢]¥ = eto, + edo) (9.146) 


+í 
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Figure 9.2. Equipotentials for a charged ring of radius a centered within a grounded sphere of radius 
b, where a/b = 0.5. 


where 
5 x y a GLa y > 
e6, == ——= 9 e_= —=, é0 = €, (9.147) 


The multipole expansion of the magnetic vector potential 


Al] = f dv’ J] 
r-r] 


1 é (9.148) 
J “iml, a f ao arrei PI Yes 4] 


1 00 


then becomes 


00 


Al] = =$, 


D Y, 10, 6] | dg’ rn [79] erre) (9.149) 


1 = 2 m=-1 


where r_ is the smaller and r, is the larger of r and a. By writing 


Yin 10, 61 = Y, mlO, Ole"? = [ Y'n 10, p]e+? = 2700 y, 4.1 Y mF, O] (9.150) 
we obtain 
apy = la 4n r, i 


Ra 
(v.16. 01%,,[5.o]ee, +Y,,10,01%,, [5,0 ete) (9.151) 
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Figure 9.3. A, for current loop. 


Next we use 


Yom = (Ym Yinl@, 0] = Y, mlO, 0] 
to obtain 
N la 4r r T 
AglFl = 2 D 37,7 707 %a16, 01%, 5, 0] 
1=0 > 


Finally, we use 


Y E o] da p= 
Mm =, = P m -1/2 n 1 
2 ae (Se) E id 
and 
Aril + y? 
Y, [0,0] = -( Be, di 
to obtain 


ñ la A (-)" Qn—1)!! r2! 
Agl?] =21 2 REM pra Pons1.[CostAl] 
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(9.152) 


(9.153) 


(9.154) 


(9.155) 


(9.156) 


in terms of associated Legendre polynomials. The appearance of associated Legendre 
polynomials in the vector potential, instead of ordinary Legendre polynomials in the scalar 
potential for a ring, reflects the different symmetry properties of vector and scalar fields. 


Contours of A, are sketched in Fig. 9.3. 
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The magnetic field is obtained using 


mn a 1 Ds 10 
The angular derivative can also be expressed as 
pa = g 1 94 = 9 2\1/2 
5 00 a > sinj ag Sim) = ¿(0 — "7A gla) (9.158) 


Ox 


where x = Cos[6]. Expressing the associated Legendre polynomial as 


2 ô 
P bd = (1-2)? A = = ((1 = 2°)! P) = (1%) P- 2P 


Ox 
(9.159) 
and using the Legendre differential equation, we find 
1 O04. 
Sino] 38 (Sin[9]P, ,[Cos[4]]) = 77 + 1)P,[ Cos[6]| (9.160) 
and obtain 
Ta A (2 Qn + 1)!! rt! 
B, = 2n— 12 al a Ps +1 [Cos[0]] (9.161) 


The B, component is easily evaluated but must be separated into inner and outer regions, 
whereby 


la A (y 2n — DH rl 
r <a = By = 4n— a ae Psn+1,11Cos[8]] (9.162) 


n=0 
la A (Y (2n + 1)!! a2"! 

n ! 2Qn+2° 2n+1,1 
cr 2” (n+D! r 


[Cos[8]] (9.163) 


9.6 Inhomogeneous Wave Equation 
9.6.1 Spatial Representation of Time-Independent Green Function 


Consider an inhomogeneous scalar wave equation of the form 
Le V | wit, t] == 4aplr, t] (9.164) 
a r,t] == 4zp[r, ; 
e ôt? a 

where w represents the wave amplitude, c is the phase velocity, and p represents a source 


that we assume is localized in both space and time. It is useful to perform a Fourier analysis 
of the time dependence 


plr, t] = f So ball (9.165) 
œ d . 
wir, 1] = f S eP] (9.166) 
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such that the spatial dependence is described by the inhomogeneous Helmholtz equation 
(V? + P) yP] = 4p, [7] (9.167) 


where k = w/c is the wave number. One could, of course, perform a Fourier analysis of 
the spatial dependence also, but we prefer to begin with a more traditional partial-wave 
expansion of the Green function, which satisfies an equation of the form 


V?G + KG == -4n6[7 - 7] (9.168) 


with boundary conditions to be specified later. Once we have the Green function, the solu- 
tion to the original wave equation becomes 


M= A+ [array (9.169) 
where ¢, is a solution to the homogeneous equation 
(V? + KGP] = (9.170) 


that is determined by matching the boundary conditions. 

For example, in the time-independent formalism for scattering by a localized distribu- 
tion, one specifies that the asymptotic wave function takes the form of an incident plane 
wave plus an outgoing spherical wave of the form 


Exp[zk’r] 
r 


r> 1 => WF] ~ Explik-7] + fIR', K] (9.171) 


where the scattering amplitude f is the amplitude of the spherical wave. The incident wave 
vector k = ek specifies both the frequency w and direction k for the incident wave while 
the direction of the outgoing wave is specified by k’ = 7. The magnitude of the scattered 
and incident wave vectors are equal for elastic scattering, such that k’ = k = w/c, but 
differ for inelastic scattering. If we choose the 2 axis along the incident direction, we may 
write the asymptotic wave function for elastic scattering as 


r >r => 4,15] = Explikz] + f,[9, go (9.172) 


F 
Including the time dependence, the planes of constant phase in the incident component 
p[r, 1] = Exp[i(kz — wt)] (9.173) 
are clearly seen to travel in the direction of increasing z with phase velocity c. Similarly, 
spheres of constant phase in the scattered wave 
Expli(kr — wt)] 
r 


Y [7 t] oc 


move radially outward with the same phase velocity. Thus, outgoing boundary conditions 
require 


(9.174) 


yk Explikr 
is pjs A (9.175) 
qe 
However, a more rigorous treatment would represent the incoming wave by a localized 


wave packet of finite extent, rather than a plane wave. 
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Recognizing that k > 0 = G > [Fr — 7’|-!, we seek a solution of the form 
_ SIR] 
R 


where R = |r—7’|. It is convenient to temporarily shift the coordinate system to make 7’ >0 
and to employ 


G (9.176) 


VIGS) = fVg+ eV +2Vf Vg = VË = “Vg = 5 “8 — 4n6[?] (9.177) 
Using the spherical symmetry of g[r] to write 

V*slr] = E ar) = rg" [r] + 2g'[r] (9.178) 
the Helmholtz equation becomes 

8" [r] + gtr] = -4701 — glrDolr] (9.179) 


Therefore, by requiring g[0] == 1 we immediately obtain g = aExplikr] + bExp[-—ikr] 
with a + b = 1. Outgoing boundary conditions require G œ e®/R for R > oo, such that 
Da Exp[iklr — 7 
Cpe (9.180) 
Ir] 
represents an outgoing spherical wave produced by a point source at 7’. Similarly, the 
Green function for incoming boundary conditions becomes 


: Exp[-ik? - ” 
a prj AL ari (9.181) 


- 7] 
It is also useful to express these functions in terms of spherical Hankel functions 


+ Exp[+iklr — 7’ + 
Gi [7,7] = at = +ikho kt 71 (9.182) 
T-T 
where only / = 0 is needed because the Green function is spherically symmetric with 
respect to the distance from the source. 

Suppose that p,, represents a localized source, such as an antenna, that radiates out- 
going waves and that there is no incident wave. We can then drop the ¢, contribution and 
employ the Green function for outgoing boundary conditions, such that the solution to the 
inhomogeneous Helmholtz equation takes the form 

z Exp[iklr-7'|] ., y 
jll = f EY O O (9.183) 
r= 
In the far field, where r is much greater than any r’ for which p,, has appreciable strength, 
we may use 
Per Exp[iklr — 7 Exp[ik. > 

PsP => PoP IA rhi ier ) = Ppr A 18h) 

f 


Ir—7] r 


where k = k? to approximate the Green function. Note that 1t is important to include the 
dependence of the phase of the exponential upon r’ because cancellations between various 
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parts of the source are crucial to the amplitude and angular distribution of the scattered 
wave, but the variation of the denominator is much less important and can be neglected for 
a localized source. Thus, in the far field we obtain an asymptotic approximation 


y a = Explikr] 
r>r => plr] = FUI (9.185) 
that has the form of an outgoing spherical wave whose angular distribution 
fik] = J Exp[-ik -7]p, [7] dV’ (9.186) 


is governed by the Fourier transform of the source. The quantity f [k] is known as the 
form factor. Similar form factors appear throughout theories of radiation or scattering. The 
interpretation of the form factor should now be clear. When the field point is sufficiently 
far from the source to treat the rays received from various parts of the source as parallel, 
the total amplitude is given by the sum of amplitudes from each part of the source with a 
phase, relative to the center, given by =k?’ 

It is often useful to expand the form factor in terms of multipoles of the source. Expand- 
ing the plane wave in spherical harmonics 


Exp[-ik ‘l= y) i” j [kr 11, [k] - Y, [FP] (9.187) 
7 
the angular distribution can be expressed in the form 
fl] =>, 0,14] = £16, 61 = > Ci nL, nl 8, 6] (9.188) 
1 Lm 


where the coefficients 
Cimtk] = AA dV (9.189) 


are Fourier—Bessel multipole amplitudes for the source. 


9.6.2 Partial-Wave Expansion 


It will often be useful to have a partial-wave expansion of the Helmholtz Green function. 
Let 


GAR FT = J afr WAL YF (9.190) 
1 


ó[r — r'] 


rr’ 


él? —?’] = SOY YF] (9.191) 
1 


where the boundary conditions will be specified later and where the k dependence in g, is 
left implicit. Using 


19 (rgi) 10+D 
V?g,[r1Y,, 10, a=( i l- 2 


air Y, mlO, $] (9.192) 
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the radial equation becomes 
%g,[r, r al 2 dg [r, r’] 4 e- d+ =| ls gpI =r] 
ðr? r Or rr 
The homogeneous equation is recognized as the spherical Bessel equation. If we require 


g, to be regular for r < r’ and to obey outgoing boundary conditions for r > r’, then we 
expect g, to take the form 


glr, r']=Aj [kr] 05? [kr] (9.194) 


(9.193) 


where A is a constant, j, is the regular spherical Bessel function, and n” is the outgoing 
spherical Hankel function. The constant is determined by the discontinuity in slope 


0g,[r, r'] = 4r Ohi [kr] (+) 9 j,[kr] An 
a(S ar > a a(i- MOKS (9.195) 


where the term in the final parentheses is the Wronskian between the solutions in the inner 
and outer regions. Using 


an? d w n 
Ox 


hn” [x] = brinda = ja] -h [x] = jen; [x] -allj I] (9.196) 
and the Wronskian 

jbga; x] - 1 [x] 7; [4] = 5 = A =Anik (9.197) 
we finally obtain the partial-wave expansion 

GPF, |= Exp[iklr — r'|] “A 


-r| 


rik Y ¡Mer Vn” Ikr 10,51 YF] (9.198) 
1 


The asymptotic form of the wave function for r >> r’, where r’ is limited by the source p, 
can now be written as 


r>r5r => 4, [1] = Ol? + ik hr” [kr] AA kr’ YinlF lou 1dV" (9.199) 
l,m 


The angular dependence of the scattered wave is coupled to the angular properties of its 
source. Using the asymptotic behavior of the Hankel function 


n = fl 
Pee = (9.200) 
kr 
we find 
Yal] = pll + E == Ci nlKIY, LF (9.201) 
l,m 
Cim = ef j¡[kr"YY,,, 17 Jp, [71 av’ (9.202) 


Thus, the scattered wave is expanded in outgoing waves with definite orbital angular 
momentum whose amplitudes are determined by a multipole expansion of the Fourier— 
Bessel transform of the source. 
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9.6.3 Momentum Representation of Time-Independent Green Function 


Another useful representation of the Helmholtz Green function is obtained by expanding 


GIP, 7] = f ae BIRK, T lolk', 7] (9.203) 
Qry 
where ok, T] is an eigenfunction of the homogeneous equation 
(V? + POLK, 7] = (9.204) 
and where 
(V? + GIP, 7] == —47ró[F - 7] (9.205) 


with the appropriate boundary conditions. Using olk, rj = Explik - T] and applying the 
differential operator V? + k? we obtain 
Pr 


= (K — k)B[K’, 7] Exp[ik’ - 7] 


(V? + 12)G[F, 7] = a 
(9.206) 


Pr > 
= -4n Exp[ik’ -(F- 7’ 
f On plik’ -( )] 
where in the last step we employ the Fourier representation of the delta function. Using 


the orthogonality of plane waves, we deduce 


Exp[-ik’ -7] Bk’ Explik’ -(+-7)] 


K,7] =4n => Gli,7"] = 47 9.207 
BL r ] k? k2 Ņ r ] (Qry k? = ke ( ) 
The angular integral can be done by expanding the plane wave, such that 
ZA [kR 2 p” Sin[k’R 
GIP, ?]= 2 f dx? Jol a f dk zal i 
m Jo k-k TR Jo k* -—k 
da 135) (9.208) 
1 ,,, EXplik'R] 
IMR Jo k-k 


where R = |r — 7|. Ordinarily we would exploit the symmetry of the integrand to replace 
the sine function by an exponential and to extend the range of integration with respect 
to k’ to +oo so that we can use a great semicircular contour closed in the upper half- 
plane, but the integrand has singularities at k’ = +k that are on the contour. Fortunately, 
we still have some unused information, namely the boundary conditions, which can be 
used to determine the proper handling of these singularities. Imposing outgoing boundary 
conditions in the far field, R > oo, suggests that we should include the positive pole and 
exclude the negative pole. With this choice we obtain the same outgoing Green function 


_ Explikl? 7] 


P7] 


GP P, P] (9.209) 
as before. Alternatively, incoming boundary conditions are satisfied by including the neg- 
ative and excluding the positive pole. The appropriate contours for these conditions are 


sketched in Fig. 9.4. 
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Figure 9.4. left: outgoing bc, right: incoming bc. 
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Figure 9.5. left: outgoing bc, right: incoming bc. 


Another method for achieving the same result is to modify the Helmholtz equation by 
adding an infinitesimal 6 


k> ktit E = ksi? (9.210) 


such that the poles at k’, are shifted off the real axis asymmetrically, as sketched in Fig. 9.5. 
If 6 > 0*, we obtain outgoing boundary conditions while if 6 > 07 we obtain incoming 
boundary conditions. Therefore, the momentum representation of the Green function can 
be expressed as 


—4r 


Gk, k] = == 
[kk] k-k? +16 


(9.211) 


where 6 is a positive infinitesimal and where the spatial representation is obtained using 
the Fourier transform 

ak’ Exp[ik’ - (7-7) 
2m? K- k’? iô 


Gi [7,7] = -4r (9.212) 
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9.6.4 Retarded Green Function 


If one performs a Fourier analysis of both the time and spatial dependencies for 


Ëk (° dw gs i 
a ll a Pall (9.213) 
3 Ëk (“do irony è 
US l 50 R] (9.214) 


the inhomogeneous wave equation takes the form 


N ae a oS P 
(e — (2) Jatt =p, [k] = 0,10 = 25 (9.215) 
a k-e 


where, in this approach, k is not restricted to w/c. The spatial wave function is obtained 
by inverting the Fourier transform 


WI? t] = -c? 


3 co dis o [k 
f dw pilor) roma (9.216) 


OTP J- 20 -Re 


but care must be exercised in handling the singularity. It is simplest to consider the Green 
function for a point source in both space and time, which satisfies an equation of the form 


2 
(3 a3 - y) GÍr, 1,7, 1"] = 4n6[+ - 7P]6 [1 - 1] (9.217) 
C 


with boundary conditions to be specified later. Thus, using 


pl, t] = 4n6[7 - ?’]6[t — 1] = p, [k] = 47 Exp[-i(k -? — wt”)] (9.218) 
we find that 
Bk (° dw Explik-(F-7)-w(t-1 
Gnr iire [ LE f ES (9.219) 
(Qn) Joo 27 w —kc 
represents a wave produced by a point source at position 7” at time t’. 
The angular part of the integral can be performed easily using 
Explik - R] = any ij, RIVA] -Y,[R] (9.220) 
7 


where R = 7—7 and R = IRI, such that 


Ss 2c? pa 2 dw Exp[—iw(t - t’)] 
G[r, t;7’, t’] = — dkk? j [KR 9.221 
Bar aeaf SP (9.21) 


is limited by spherical symmetry to / = 0. The integrand exhibits poles on the real axis 
of the complex frequency plane at w = +kc. The integral does not have a unique value 
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Figure 9.6. Contours for Retarded Green Function. 


because the poles lie on the integration path, but we can appeal to causality to determine 
boundary conditions that specify the appropriate contour. Thus, we require G to vanish for 
t < ft’ such that causes precede effects. This is accomplished by employing a contour that 
is slightly above the real axis, such that w > w + ie where € is a positive infinitesimal, and 
closing in the contour in the upper half-plane for t < t’ or the lower half-plane for t > t’, 
as shown in Fig. 9.6. Alternatively, one could imagine shifting the poles slightly below the 
real axis, such that w, = +kc — ie and taking the limit e > 0* after evaluating the integral. 
Either way we obtain 


T dw Exp[-iw(t -t’)]  Sin[kc(t - t’)] 
oes = 0 
-œ 217 (w + ie) 


tt 9.222 
-ke kc | l ( ) 
where © is the unit step function. 
The retarded Green function then takes the form 


os 2c Oft —1' di 
GTR, t; ř,t'] = 7 Ol | 
T R 0 


dk Sin[kR] Sin[kc(t — t’)] 


cOol-$] pa. ; 
= ‘A Ak(Cos|k(R — c(t - 1’))| 


- Cos[k(R + c(t — 1’))]) (9.223) 
eorr] E : : 
a S dk(Explik(R — c(t — t'))]. 
— Explik(R + c(t - 1’))]) 
Therefore, we finally obtain a retarded Green function 
GOP, GY, r'] = ool =P | = c(t = )JO[: =f ] (9.224) 


r= 7"| 
describing a spherical delta-function shell propagating outwards with velocity c. The con- 
tribution that propagates inward for earlier times is suppressed by the step function. How- 
ever, for some problems it might be appropriate to consider either the advanced Green 
function, GO, or a standing wave obtained using different boundary conditions. 

The wave function for a variable source can now be computed using 


Melia f J a TA (9.225) 


FF 
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such that 


=, = FF] 
wir, t] = f Pinte | ay (9.226) 


P- 7] 


resembles the solution to the static Poisson equation except that the potential at position 7 
contributed by a source at 7’ is determined by an earlier, or retarded, time t — |F — 7’|/c as 
expected for a finite propagation velocity. 


9.6.5 Lippmann-Schwinger Equation 


The time-independent Schródinger equation 
h? 
(7 + via) v[r] == Ev[r] (9.227) 


for positive energies E = f?k?/2m can be expressed in a form 
(V? + Pir] = viy] (9.228) 


that strongly resembles the inhomogeneous Helmholtz equation except that the source term 
is proportional to the wave function itself. Here we define v = 2mV/h* for convenience. 
Thus, a formal solution can be expressed in the form of the Lippmann—Schwinger equation 


n= om [MIA (9.229) 
where ¢ is a solution to the homogeneous equation 

(V? + Ol] = (9.230) 
and where the Green function satisfies 

(V? + 2)G[5,7] = fF - 7] (9.231) 
and with outgoing boundary conditions becomes 
2, Exp[iklr — 71] 
4n i-r] 


(9.232) 


Note that one must always be aware that the normalization of the Green function depends 
upon the form chosen for its point source and tends to vary with the application. 

With the unknown wavefunction y appearing on both sides of this integral equation, 
one may question whether anything has been accomplished. Nevertheless, the Lippmann- 
Schwinger equation has proven to be very valuable in developing approximations and 
analyzing the general properties of scattering solutions. For example, if the potential is 
sufficiently weak, we can assume that its effect is relatively small and use 


Y p => Ul © ell + f GU. FIVE 1a" (9.233) 
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to obtain the Born approximation. This approximation can be improved by substituting the 
first-order approximation back into the Lippmann—Schwinger equation to obtain 


Wer | ¿rar FW 
+ f Br dr” GE, PIV EIGE, PIVE oR] +e (9.234) 
Although it is often difficult to prove that the series converges, iteration of this successive- 
approximation procedure provides a multiple-scattering series that is formally exact. The 


notation for this series can be simplified considerably by using an operator representation 
of the Lippmann-Schwinger equation 


Y == 6+ Gw (9.235) 
Next we define the transition operator T by the operator equation 

To = vy (9.236) 
and multiply both sides of the Lippmann-Schwinger equation by v, such that 

vy == VO + vVGvw => To = (v + vVGT)W (9.237) 
and conclude that the transition operator satisfies 

T =v+vGT => T = v + vGv + vGyGy + +** (9.238) 


For elastic scattering by a short-ranged potential, one naturally chooses a plane wave for @ 
and uses the outgoing Green function such that 


m f Exp[ikl? — 7] 


7] = Explik-7] — 
ii E FF] 


Vir wir lar (9.239) 


For distances much larger than the size of the source, we can approximate the phase of the 
Green function using 
PY Exp[iklr — 7°’ ExplLik > 
i ot eats A A 0) 
r r 


P-P] 


where k’ = k? is the wave vector for the scattered wave. Note that we must treat the phase 
carefully because it varies rapidly over the size of the source, but need not be overly con- 
cerned with the denominator because the magnitude of the Green function varies slowly 
when r > r’. Thus, the asymptotic wavefunction takes the form 


Exp[ikr] 
r 


r> 1 = Ul?) = Explik-7] + fIK', k] eat) 


where the scattering amplitude is identified as 


fl. È =- f Exp[-ik’ -P]V [514177 dir (9.242) 
21h" 
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Although this result is exact, it is not especially useful without a suitable approximation 
for y. Provided that the potential is short-ranged and sufficiently weak, we can use the 
Born approximation to replace y with ¢ and obtain 


na | Explig- VF] Pr (9.243) 


where q = k — K' is the momentum transfer to the target. 


Problems for Chapter 9 


1. Between the sheets 
Two infinite parallel conducting planes at z = 0, L are held at y == 0. 


a) Using cylindrical coordinates, Y = (é, $, z), demonstrate that the Dirichlet Green func- 
tion takes the form 


G[r, 7] = 


MES 


m=-00 n=1 


a [sot - #1 >) Sinkk, 2] Sin[K, 2,18, EVK 4,1] 


(9.244) 


where k, = T and where /,, and K „ are modified Bessel functions of the first and second 


kinds, respectively. 


b) Show that this Green function can also be expressed in the form 


Sinh[kz_] Sinh[k(L — A 


OP 1=2 Y) (Eolimg -on f aks, ME METE 


m=—oo 


(9.245) 


c) Use either of these representations to evaluate the electrostatic potential produced by a 
point charge on the z axis at height h between grounded planes. 


d) Find expressions for the surface charge densities, 0, and o}, on the conducting planes 
and compute the total charge on each. 


2. Polar caps 

Suppose that a portion of a sphere of radius R contained within O < 6), described as a north 
polar cap, is maintained at potential +V, and that the corresponding south polar cap with 
0 > 7 — 0, is held at potential —V, while the remainder of the sphere is grounded. 


a) Develop an expansion for the electrostatic potential within the sphere. The coefficients 
may involve Legendre polynomials with fixed argument. Which terms contribute? 


b) Write the corresponding expansion for r > R. Describe the asymptotic behavior of the 
potential for r >> R and find an explicit expression for the effective dipole moment. 
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3. Dirichlet Green function for two-dimensional semicircle 
Consider a two-dimensional semicircular region defined by (0 < ë < 4,0 < $ < 7). Use 
the eigenfunctions of 


(V? + Anm) WEF] =0, Yla, 6] = WIE, 0] = WIE, 7] = 0 (9.246) 
to expand the two-dimensional Dirichlet Green function for the Poisson equation 
V?G[F, 7] = —4mó[* - 7] (9.247) 


Although numerical values for the eigenvalues are not required, you must provide the 
equations that determine them. 


4. Piece of pie 

Suppose that an electrode is shaped like a piece of pie — a wedge of radius a and opening 
angle a. Evaluate the two-dimensional Dirichlet Green function using separation of vari- 
ables in polar coordinates, (€, ). 


5. Cylinder with grounded endcaps 
An empty cylinder of radius a with its axis along the 2 axis has grounded endcaps (y ==0) 
at z = 0 and z = L while its curved surface is held at potential W[a, ¢, z] == VILO, z]. 


a) Develop an expansion for the electrostatic potential W[€, 6, z] within the cylinder and 
express the coefficients in terms of the appropriate integral over V[d, z]. 


b) Determine the coefficients for the simple case V[@, z] = V,(1 — 20[4 — z]) where © is 
the unit step function. 


6. Cylinder with opposite potentials on its endcaps 
The curved surface of a cylinder of radius a is grounded while the endcaps at z = +L/2 
are maintained at opposite potentials W[é, ¢, +L/2] = +VIE, @]. 


a) Develop an expansion for the electrostatic potential W[€, 6, z] within the cylinder and 
express the coefficients in terms of the appropriate integral over V[€, ¢]. 


b) Determine the coefficients for the simple case V[E, 6] = V, where V, is constant. 


7. Cylinder with diametrically opposed electrodes 

Suppose that a long conducting cylinder of radius a has edge electrodes at potentials y = V 
for |6| < a and y = —V for |0 — z| < a. The remaining portions of the cylinder are held at 
Y = 0. Evaluate the electrostatic potential in both interior and exterior regions. 


8. Split-sphere acoustic antenna 
Sound waves produced by a split-sphere antenna with radius R satisfy 


Py 


1 Ae 0<0<35 
e ye 


—Ae 0! Z <O<T 


- Vy = 0, ragni (9.248) 


Evaluate y[r, t] for r > R with outgoing boundary conditions. 
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9. Acoustical waves from vibrating cylinder 
The surface of an infinite cylinder of radius R vibrates harmonically such that the pressure 
at the surface is given by 


WIR, 0,1] = f[O]e (9.249) 
where the angular function f[0] is prescribed. The pressure for r > R satisfies the wave 
equation 


(> 2 y) di (9.250) 
E 


with outgoing boundary conditions. Construct a formal solution for the field. 


10. Acoustical wave guides 
Density displacements w[7, t] for a gas in a confined volume satisfy a wave equation of the 
form 


55 -VWy=0 =0 (9.251) 


where c is the sound velocity and 0y/0n = ii - Vy is the normal derivative at a boundary 
surface. 


a) Evaluate the normal modes for a long rectangular wave guide with cross-sectional 
dimensions a X b. Show that for most modes there is a minimum frequency for trans- 
mission. Compare the phase and group velocities. 


b) Evaluate the normal modes for a long cylindrical wave guide with cross-sectional radi- 
us R. Show that for most modes there is a minimum frequency for transmission. Com- 
pare the phase and group velocities. 


11. Acoustic modes 
Sound waves in a confined volume satisfy a wave equation of the form 


55 -Vw=0 == (9.252) 


where c is the sound velocity and 0y/Ón = ñ - Vy is the normal derivative at a boundary 
surface. The normal modes take the form 


yil, t] = y [He o* (9.253) 


where the eigenfrequencies w, may be degenerate, requiring additional indices to distin- 
guish between degenerate modes. 


a) Evaluate the normal modes and eigenfrequencies in a box with dimensions ax b x c. 


b) Evaluate the normal modes and eigenfrequencies for a sphere of radius R. 
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c) Evaluate the normal modes and eigenfrequencies for a cylinder of radius R and length L. 


12. Organ pipe 
The pressure fluctuation y in an organ pipe of length L satisfies the wave equation 


1 Pwlx, t] B yix, t] E 
e $e aeo 


-0 (9.254) 


where c is the speed of sound. The end at x = 0 is open while the end at x = L is partly 
closed such that the boundary conditions are 
ow 


W[0, t] == 0, (v + oF] a 0 (9.255) 


where « is a constant. 


a) Construct the normal modes and derive the equation that determines the characteristic 
frequencies. Describe a graphical solution to this equation. 


b) Demonstrate that the normal modes are orthogonal. 


c) Suppose that the pressure in the pipe is in equilibrium for tf < O and that constant 
pressure y, is applied to the open end for £ = 0. Find a series representation for w[x, t]. 


13. Vibrations of a membrane wedge 

Suppose that a taut membrane is stretched across the area described by a < & < b, 
0, < 0 < @,. Develop the vibrational eigenfunctions and the condition satisfied by the 
eigenfrequencies. Demonstrate that the eigenfunctions are orthogonal and provide a for- 
mal expansion for the initial-value problem (specified initial displacement Y[É, 6, t = 0]). 
It is not necessary to obtain an explicit formula for the radial normalization integral. 


14. Critical mass 
Suppose that the neutron density w in a fissionable material satisfies an inhomogeneous 
diffusion equation of the form 


OY a (9.256) 


where x is the diffusion constant and A depends upon the fission probability per neutron. 
For simplicity assume that y = 0 on the surface of the material (neutrons escape rapidly at 
the surface or are absorbed by the surrounding medium). 


a) Suppose that the material forms a sphere of radius R. Determine the critical radius R,, 
beyond which the neutron density is unstable (exponentially increasing) and produces 
an explosion. 


b) Determine the critical radius R,, for a hemisphere of the same material. If two hemi- 
spheres that are barely stable are brought together as a sphere, the final configuration 
will be unstable. Express the explosive time constant T, in terms of k and A such that 
Y ~ en. 
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15. Heating simple solids 

Suppose that a simple solid (brick, sphere, cylinder, etc.) with uniform initial temperature 

is immersed at time £ = 0 in a heat bath. The temperature w[7, t] within the material 

satisfies 
1 Ow 


= 2 Tr = R > = 
cae PM. Mea (9.257) 


where R is on the surface. 


a) Determine the temperature distribution for positive times within a brick with 0 < x < a, 
O<y<b,and0 <z<c. 

b) Determine the temperature distribution for positive times within a sphere with r < R. 
What is the asymptotic time dependence of the central temperature? 

c) Determine the temperature distribution for positive times within a cylinder with € < R 
and 0 < z < L. 


16. Cooling sphere 
A solid sphere of radius R has a spherically symmetric initial temperature distribution 
WoLr] and cools at its surface according to Newton’s law of cooling 
ð 
rR yt aR = (9.258) 
F 

If one can neglect the radial variation of the diffusion constant, this problem provides a 
simple model for the cooling of a planet. 


a) Develop a series representation for Wr, t] and express the coefficients in terms of Wo. 
b) Evaluate the special case where y, is constant and a is negligible. Plot the temperature 


dependence for several radii. 


17. Diffusion in a cylinder with constant surface temperature 
A long solid cylinder of radius a has initial temperature distribution y,[é, 6] and constant 
surface temperature y[a, 6] = 0. The temperature satisfies the diffusion equation 
1 Ow 
K Ot 


where the variation with height is ignored. Develop a series expansion for y[é, ¢, t] and 
express the coefficients in terms of Wo. 


= Vw (9.259) 


18. Surface waves on ideal fluid 

The velocity field v = Vy for the flow of an ideal fluid can be expressed in terms of 
a velocity potential y that satisfies Laplace’s equation, V7 == 0, subject to appropriate 
boundary conditions. Suppose that a fluid with equilibrium depth h is essentially infinite 
in two dimensions, x and y, but has a flat floor. The boundary conditions are then 


au 1y) ay 
(2 ij ih, dl l = 7 a 
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a) Find a general expression y,,[x, y, z, t] for a plane wave with frequency w that propagates 
in the x direction. 

b) Determine and sketch the phase and group velocities as functions of kh, where k is the 
wave number. 

19. Standing waves on ideal fluid in rectangular tank 

The velocity field v = Vy for the flow of an ideal fluid can be expressed in terms of 

a velocity potential y that satisfies Laplace’s equation, V?y == 0, subject to appropriate 

boundary conditions. Suppose that a fluid with equilibrium depth / is in a rectangular tank 

with dimensions a X b and a flat floor. The boundary conditions are then 


a) Construct standing-wave solutions y, [x, y, z, t] and determine their frequencies. 


nm 


b) Construct general solutions that satisfy the initial condition 
v¿Lx, y, 0, 0] = vol% y] (9.262) 


20. Seasonal variation of ground temperature 
In the flat-earth approximation, the temperature y at depth z can be described by a one- 
dimensional diffusion equation of the form 


== 2 (9.263) 


where x is the thermal diffusion constant. Suppose that the external temperature can be 
approximated by a sinusoidal variation of the form 
YIO, t] = Y, + Yi Sin[wt] (9.264) 
where w = 27/T for period T. 
a) Solve for w[z, t] and determine the penetration depth d and the phase delay for propa- 
gation of thermal waves into the ground. 


b) The penetration depth for annual variations is approximately 3 m. Determine the phase 
delay for annual variations. Also determine the corresponding penetration depth and 
phase delay for daily variations. Discuss the separability of these periods. 


c) Obtain Y[z, t] for combined annual and diurnal variations. 
21. Current distribution in wire 


One can show that the current density J satisfies an equation of the form 


(¥ See ); = 0) (9.265) 


when the permittivity e, permeability u, and conductivity o are constant. The radial distri- 
bution of current flowing with sinusoidal time dependence along a long straight wire with 
circular cross-section of radius a can be represented as 


JIE, t] = Re[Wlé] Exp[-iwr]]| (9.266) 
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a) Show that [E] is related to a Bessel function with complex argument and normalize the 
solution to total current J. 

b) Express the solution for a good conductor with 420°/ew >> 1 as a function of y a where 
y= Ni Aro uw/c?. Plot and discuss MAM yla]? for both large and small values of ya. 

c) Evaluate the impedance Z = V/I where V is the voltage drop over a length / measured 
at the surface of the wire. The impedance can be expressed in the form Z = R — iwL 
where R is the resistance and L is the inductance for frequency w. Evaluate R and L for a 
good conductor in both small and large w limits. Sketch and interpret R[w] and L[w] for 
a good conductor. It is useful to express these quantities in terms of the skin thickness 
6 = V2/y. 

d) Express the solution for a good conductor in terms of the Kelvin functions, ber,[x] and 
bei, [x], defined by 


Je 4x] = ber, [x] + ibei, [x] (9.267) 


for positive real x where J, is the regular Bessel function of order v. (Hint: compare the 
differential equations for J,[e*'”/4x] with the present equation.) This representation is 
sometimes seen in electrical engineering. 


22. Evolution of line vortex 
The vorticity distribution w[x, y, t] in a viscous fluid satisfies a diffusion equation of the 


form 
ve == Vw (9.268) 


where the kinematic viscosity v is a positive constant. Suppose that at ¢ = O there is a line 
vortex 


olx, y, 0] = Trôlx]ó[y] (9.269) 


where I is constant and assume that w > 0 when r > oo. Determine the subsequent 
behavior of w using each of the following methods. 

a) Use a Laplace transform with respect to time and look up the required inverse transform. 
b) Use a two-dimensional Fourier transform with respect to position. 


c) Assume a similarity solution of the form w = E f [5] and determine f. (Hint: examine 
the behavior of the differential equation for f[£] to show that the substitution f[€] = 
g[é]e~*/4 provides an integrable equation for g[£].) 


23. Scattering by separable s-wave potential 
Often the interaction between two particles is nonlocal, for which the time-independent 
Schródinger equation takes the form 


(K +v2)y |È P] -2u I VRP Wk] dr = 0 (9.270) 


where k is the wave number in the center of mass and y is the reduced mass. 
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a) Develop an integral equation for scattering solutions and identify the scattering ampli- 
tude. 


b) A separable s-wave potential is defined by 
2uV [7,7] = wiri [r] (9.271) 


where A is a strength parameter and v[r] depends only upon the distance r = |r|. Obtain 
an exact solution for the scattering amplitude in terms of v[k], where 


v[k] = IN v[r] jp [kr]? dr (9.272) 
0 


Under what conditions does the Born series converge? (Hint: solve first for the quantity 
Alk] = f virlv[k,7] dr.) 


c) Evaluate the scattering amplitude for the specific case of a Yamaguchi potential with 


V[k] = x 
a= w 


So (9.273) 


where A is a positive range parameter. Express the phase of the scattering amplitude in 
terms of the dimensionless parameters y = —27A A and K = k/A. Under what conditions 
does the potential support a bound state? For neutron—proton scattering there is a low- 
energy bound state in the 3S, channel but the potential for the 1S, channel is not quite 
strong enough to produce a bound state. Compare the phase shifts for these channels as 
functions of x assuming that y is either 5% stronger or 5% weaker than the minimum 
strength for a bound state. (Note: for the phase shift it is best to use the quadrant- 
sensitive version of the inverse tangent function.) 


d) Express k Cot| do Lk], where 6,[k] is the phase shift, in terms of ¥[k]. Obtain general 
expressions for the scattering length and effective range assuming that Y[k] can be 
expanded in even powers of k. Then evaluate the scattering length and effective range 
for the specific case of a Yamaguchi potential. Describe the behavior of the scattering 
length for y near the critical value for a bound state. 


24. Scattering by a soft cylinder 

Plane waves with wave number k are incident upon an infinitely long cylinder of radius R 
with the direction of propagation perpendicular to the axis of the cylinder. The surface 
of the cylinder is soft, such that the total wave (incident plus scattered) vanishes at R. 
Construct a formal expansion for the scattered field and then examine the leading term in 
the long-wavelength limit KR < 1. 


10 Group Theory 


Abstract. Group theory provides powerful methods for analyzing the consequences 
of symmetries. We begin with the basic concepts for finite groups and their matrix 
representations, including orthogonality theorems and methods for constructing and 
analyzing irreducible representations. These concepts are then extended to continu- 
ous Lie groups. The irreducible representations of the quantum mechanical rotation 
group and their direct products are analyzed in detail. Finally, the application of uni- 
tary groups to particle theory is discussed briefly. 


10.1 Introduction 


Many of the general features of the dynamics of physical systems are determined or 
largely constrained by their symmetries, and symmetries are often associated with con- 
served quantities. For example, momentum conservation is a consequence of translational 
symmetry, angular momentum conservation originates in rotational symmetry, and parity 
embodies reflection symmetry. By classifying the states of a system according to irreduc- 
ible representations of its symmetry group, one can deduce general selection rules that 
forbid certain transitions or can deduce patterns among the allowed transitions. These and 
other implications of symmetries can be studied using the mathematical theory of groups 
in which the symmetry operators are treated as elements of an algebraic system. For exam- 
ple, the symmetries of an equilateral triangle in a plane consist of an identity element, two 
rotations, and three reflections. These operators are described as elements of a group that 
obeys algebraic rules similar to those of ordinary algebra, except that the group elements 
are more abstract than the natural numbers. Thus, one can often deduce important prop- 
erties of the physical system by applying the theorems of group theory to its symmetry 
operators instead of by solving the differential equations that describe its dynamics. The 
early development of group theory was largely motivated by crystallography, but its most 
important applications are now in quantum mechanics, field theory, and particle physics. 
The symmetries of global gauge transformations determine the conserved quantities while 
the symmetries of local gauge transformations of quantum fields determine, up to coupling 
constants, the types of interactions among the particles represented by those fields. In fact, 
so important has group theory become to particle theory that an early theory of grand uni- 
fication using supersymmetry is generally referenced by its symmetry group, SU(5), rather 
than by either its dynamical principles or its authors’ names (Georgi and Glashow). 

The theory of groups is a very large and highly developed subject in both mathematics 
and physics. A comprehensive treatment would require at least one semester and a dedi- 
cated textbook. Nevertheless, this topic is sufficiently important to modern physics that we 
include a synopsis even though it diverges from the main topic of this book. We begin with 
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the theory of finite groups and their matrix representations. We include proofs of most of 
the theorems; these proofs presume familiarity with linear algebra. Our primary examples 
study small amplitude vibrations of highly symmetric configurations of masses. We then 
generalize these results to continuous groups, especially Lie groups, but do not always 
provide proofs — many of the results are analogous to those for finite groups but the proofs 
are often more technical. We discuss several types of representations, including matri- 
ces, differential operators, and eigenfunction multiplets, emphasizing that general results 
depend upon the abstract structure of the associated Lie algebra instead of the particular 
representation at hand. These techniques are then used to analyze the quantum mechanical 
rotation group in considerable detail. Finally, we give a brief discussion of the application 
of unitary groups in particle physics. 


10.2 Finite Groups 
10.2.1 Definitions 


Consider a finite set of objects, S, = {a i = 1, N}. A transformation M is a one-to-one 
mapping of S, > S, described by Ma, = a,. For example, the permutation 


Ma, =a, Ma,=a;, Ma,;=a, (10.1) 


is one example of a one-to-one mapping of {a,, a,, az) upon itself. A set of such transfor- 
mations, G = (M,, i = 1, m}, is described as a group with respect to a law of composition, 
figuratively described as multiplication, if it satisfies the following conditions. 


1. If M, and M, are elements of the group, then their product M,M; is also a member of 
the group. 


2. Multiplication of group elements is associative, such that M,(M iM, 4) = (MM pM, p 


3. The group must contain an identity element, labeled J, such that IM, = M¡I = M; for 
every M, € G. 


4. For every M, € G, there exists an inverse element M7 l € G such that M¡'M, = 
MM! =1. 


The number of elements in a group is called its order. Groups may be finite, countably 
infinite, or continuous in order. The present section develops some of the general properties 
of finite groups but many will apply, with obvious generalizations, to infinite groups also. 

Note that “multiplication” means composition of successive transformations, which 
is not necessarily multiplication in the conventional sense. For example, the set of inte- 
gers constitutes a countably-infinite group with respect to addition: the set is closed with 
respect to addition (even though the number of elements is infinite); addition of integers is 
associative; the identity element is O because 0 + x = x + 0 = x; and for each x the inverse 
—x belongs to the group. Similarly, the integers (0, 1, ...,m-— 1} constitute a finite group of 
order m under addition modulo m. Ordinary multiplication of integers, on the other hand, 
does not constitute a group because there is no inverse element for 0. 
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The structure of a finite group can be exhibited by its multiplication table. Consider, 
for example, addition of integers modulo three. There are then three elements that we will 
label abstractly by J = 0, a = 1, b = 2 such that ab = Mod[a + b, 3] = 0 = I. The complete 
“multiplication” table is shown in Table 10.1. 


Table 10.1. Group multiplication table for cyclic group of order 3. 


Notice that each element occurs exactly once in every row and every column and that 
each element has an inverse. One often omits the labels from a group multiplication table 
because they simply repeat the first row and column when the identity element is placed 
in the upper left interior corner. This particular group has order 3 and is identified as C}, 
the cyclic group of order 3. Observe that each row or column is a cyclic permutation of 
the group elements. Later we will prove that any group of prime order is cyclic. Two 
elements commute if MM; = M,M;. A group is described as abelian if its multiplication 
law is commutative, such ‘that M, M; = MM), for any i, j and as nonabelian otherwise. 
The group multiplication table for an abelian group is symmetric about its diagonal. The 
addition of integers is abelian, but most groups are nonabelian. 

Two groups that can be arranged to have the same multiplication table are isomorphic. 
For example, the symmetric group S, consists of all permutations of n distinguishable 
objects. Thus, S, is of order 6 and its elements 


P, 2 3114, B, C}] = (A, B, C} (10.2) 
P, 3 2114, B, C} = (4, C, B} (10.3) 
P, ¡ 3[{4, B, CJ] = {B, A, C} (10.4) 
P, 3,114, B, C}] = (B, C, A} (10.5) 
P; ¡ [{A, B, C} = (C, A, B} (10.6) 
P; 2 ¡ 114, B, C}] = {C, B, A} (10.7) 


can be enumerated by considering their transformation of a generic set of three objects, 
denoted (A, B, C}. Alternatively, there are six symmetry operations that leave an equilateral 
triangle invariant, consisting of rotations through 120°, 240°, 360°, and three reflections 
about bisectors. The entire set of symmetries can be constructed from the identity ele- 
ment J, the basic 120° counterclockwise rotation R, and the reflection P (for parity) across 
the vertical bisector. The elements of these two groups can be placed in a one-to-one cor- 
respondence that makes their multiplication tables identical. Therefore, these groups are 
isomorphic and their formal properties are identical. In Fig. 10.1 we compare the symme- 
tries of an equilateral triangle with the elements of S} and assign generic labels for use in 
composing the multiplication Table 10.2. 

The permutation notation may be transparent, but the group properties would be the 
same whatever the context in which $} emerges. Therefore, it is customary to employ 
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Table 10.2. Isomorphism between symmetries of an equilateral triangle and permutations of three 
objects (vertices). 


Generic Triangle Permutation Order 


I I [123] 1 
a R [312] 3 
b R? [231] 3 
E P [132] 2 
d PR [321] 2 
e PR? [213] 2 


Figure 10.1. The symmetries of an equilateral triangle include the identity 7, 120° counterclockwise 
rotation R, reflection across the vertical bisector P, and the combinations R?, PR, and PR?. 


more abstract labels, such as (1, a, b, c, d, e), and to display the group multiplication table 
in the form in Table 10.3. Notice that this group is nonabelian. Some find assembly of 
the multiplication table easier using permutations and others using geometrical reasoning. 
(Make sure that you can reproduce this table using both methods!) 


Table 10.3. Group multiplication table for S,. 


S |I a b c d e 
I I a b c d e 
a a b I e c d 
b b I a d e c 
c c d e I a b 
d d e c b I a 
e e c d a b I 


Successive applications of the same transformation is represented by a power of the 
group element, such that 
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M} =M, M;', M?=1 (10.8) 


where 7 is the identity element. A transformation is described as order k if each M? is 
distinct for O < n < k. In the case of S}, we find that / is order 1, {P, PR, PR?} are order 2, 
and that {R, R?) are order 3. The powers of an element of order k form a cyclic group of 
order k, defined by 

CC, Ciro G=Q=! (10.9) 
Cyclic groups are abelian. 

Finally, we mention a trivial result that is so useful that it is often called the rearrange- 
ment theorem. Suppose that g is any element of the group G = {G,}. The product gG = 
{(gG;,), i = 1, n} contains every element of G exactly once but probably in a new order. 
Thus, any complete sum over functions of a group element can then be rearranged by mul- 
tiplying the argument by an arbitrary group element according to 


X, fish] = $, fih] (10.10) 


heG heG 


without affecting its value. 


10.2.2 Equivalence Classes 


An element b is described as conjugate to element a if there exists an element u € G 
such that uau”! = b. In other contexts this relationship might be described as similarity, 
so we will employ the same symbol a ~ b. Obviously, every element is conjugate to 
itself, a ~ a, and this relationship is commutative, a ~ b => b ~ a. It is also transitive: 
a ~ b,b ~ c = a ~ c. An equivalence class consists of the set of all elements that are 
conjugate to each other. The identity element forms a class unto itself and each element of 
an abelian group also constitutes its own class; hence, the notion of classes is most useful 
for nonabelian groups. Note that all elements of a class must have the same order. If we 
construct b” using 


b = uau"! = b" = (uau"')---(uau"!) = ua"u! (10.11) 


we find ak = I = bk = 1. 

Let us return to $}. Recognizing that there are three elements of order 2, consisting of 
fc, d, e} or {P, PR, PRA), we wonder whether they constitute an equivalence class. By direct 
calculation using Table 10.3, we find 


aca! = acb = eb = d => c ~ d (10.12) 
beb! = bea = ca = d => e ~ d (10.13) 


and use transitivity to conclude that c ~ d ~ e forms an equivalence class. Similarly, a ~ b 
or R ~ R? also form a class. 
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10.2.3 Subgroups 


A subset H of the elements of G that forms a group with respect to the same composition 
law is described as a subgroup of G. This relationship is denoted using the notation of sets, 
H c G. Every group G contains two trivial subgroups, one consisting of just the identity 
element {7} and the other consisting of G itself; these are described as improper. One 
of the main problems of group theory is the construction and classification of all proper 
subgroups of a specified group. The relevant criteria are that H be closed with respect to 
multiplication and that inverses are present for each element; the associative property is 
satisfied automatically and every subgroup must contain the identity element. 

If K is a subgroup of H and H is a subgroup of G, then K is a subgroup of G. Because 
every group must contain its subgroups, one can construct the sequence G > H > K::- > 1. 
We now derive Lagrange’s divisor theorem, which states that the order of a subgroup 
must be a divisor of the order of its parent. This theorem is obviously true for improper 
subgroups containing either one or all elements. Suppose that H is a proper subgroup of 
G and label its elements (H,, i = 1, h} where A is the order of H and H, = J is the identity 
element. Consider an element a € G that does not belong to H and construct the left coset 
aH = {aH,,i = 1, h} containing the products of a with each H,. This set is not a group 
because it does not contain the identity element. Each element of aH is distinct and none 
are included in H. (Note that aH; = H; => a = H ¡Hr € H, contrary to the assumption 
that a 4 H.) If the union of H and aH, both of order h, is not G itself, we select another 
element b and construct a new left coset bH of order h that has no overlap with either H or 
aH. (If bH, = aH, then b = aH ,H;' € aH because HH € H.) Thus, we can divide G 
into a subgroup H and its cosets according to 


G=H+aH +bH +. (10.14) 


where each subset contains / elements. If g is the order of G and A is the order of H, then 
g = mh where m is the index of the subgroup and equal to the number of independent 
cosets plus one. Therefore, h is a divisor of g, as stipulated by Lagrange’s theorem. 

Once again consider $}, where 3 and 2 are divisors of its order. There is one subgroup 
of order 3, namely (/, a, b}, that is isomorphic to the group of invariant proper rotations of 
an equilateral triangle, and there are three subgroups of order 2, namely {Z, c}, {Z, d}, (1, e) 
based upon reflections across a bisector. 

If a e Gis an element of order k, we can construct a cyclic group (1, a, a,..., ac!) 
where a‘ = I. This cyclic group, called the period of a, is the smallest subgroup of G that 
contains a and its order must be a divisor of the order of G. Therefore, all groups with 
prime order must be cyclic and can be generated from one of its elements other than the 
identity. 

Although our derivation of the divisor theorem employed left cosets aH, we could just 
as easily have chosen right cosets Ha. Left and right cosets need not be the same, but the 
argument still works. In the special case that aH = Ha = H = aHa"! for any a e G, we 
describe H as an invariant subgroup of G. Invariant subgroups clearly must contain one 
or more complete equivalence classes. More generally, if H is a subgroup of G, and a is 
any element of G, then aHa”! is also a subgroup of G described as conjugate (or similar) 
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to H. Thus, invariant subgroups are self-conjugate with respect to any element of its parent 
group. A group that contains no proper invariant subgroups is described as simple. Cyclic 
groups are simple by that criterion. 

Invariant subgroups have some very important properties. Consider the multiplication 
of two cosets, aH and bH, of an invariant group H. Such a product is defined as the set 
of all products of pairs of elements from the two sets, taking care not to assume that 
multiplication commutes. We find 


(aH)(bH) = a(Hb)H = a(bH)H = (ab)\(HH) = (ab)H (10.15) 


where HH = H because H must be closed to form a subgroup. Thus, the product of two 
cosets of H is itself a coset of H. In fact, we can identify the cosets constructed from an 
invariant subgroup H as the elements of a new factor group, denoted F = G/H, such that 


F ={H, g,H, gH,...} (10.16) 


where the g, are elements of G that are not contained in H. (If g, were a member of H, the 
coset would simply be a re-ordering of H itself.) Proving that F actually is a group is an 
instructive exercise. First, we test for closure. Assuming that both g, and g, belong to G 
but not to H, we find that the product 


FF; = (g,H)\(g,H) = (g8 )H => FF, € F (10.17) 


is indeed an element of F because the product g;g; belongs to G but not H. Second, we 
verify that the associative property 


EEE) = g;H((8 4D) = (6D DjgH = (EEE, (10.18) 
is simply inherited from G. Third, we identify H as the identity element of F because 


HF, = H(g,H) = ¿(8h = g;H = F, (10.19) 
FH =(8B)H = g;H =F, (10.20) 


using associativity and H? = H. Finally, we verify that F contains an inverse 
F =D" =ER*=¿HH = gg; HH" =I (10.21) 


for every element because the inverses for each g; belong to G and not H while H™! is the 
list of inverses for each member of H, all of which belong to H. Therefore, F is indeed a 


group. 


10.2.4 Homomorphism 


Consider a mapping G > G’ where a in G is mapped onto the image a’ in G’. A mapping is 
described as homomorphic when: (1) each element of group G is mapped onto an element 
of G’; (2) the mapping preserves multiplication rules such that ab = c => a'b = c'; 
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and (3) some of the mappings are several-to-one. As a trivial example, suppose that all 
elements of the cyclic group C, are mapped onto G’ = {I} containing only the identity 
element. G’ obviously meets the requirements of a group and the multiplication rule for C 
is preserved by the mapping because the C, = C,C; > 1. Therefore, this mapping satisfies 
the requirements of homomorphism. 

Suppose that G > G’ is a homomorphism. The image of the identity element / in G 
must be the identity element /’ in G’, such that J > T. If any element g} > J’ is mapped 
onto 7’, then gi. > I’ is also needed to preserve the multiplication rules. Furthermore, 
if g, > T then all members of its class must also be mapped onto /’. Thus, all elements 
H = (81, 82 ---, 8m} of G that are mapped onto /’ constitute an invariant subgroup of G. 
All elements of the coset aH, where a € H, are mapped onto the same image a > a’ + I’ 
in G’. Therefore, G’ is isomorphic to the factor group G/H. 


10.2.5 Direct Products 


A group G = H, ® H, 9 ::: @ H, is described as a direct product of its subgroups if: 
(1) elements in different subgroups commute; and (2) each g € G is uniquely expressible 
in the form g = h,h,:-:h, where h; € H,. These conditions require that each H, be an 
invariant subgroup and that these subgroups, described as direct factors, share only the 
identity element. 

Similarly, the direct product G € G’ of two different groups is formed by taking all 
pairs (g; g,), where g; € G and g’, € G’, and evaluating the products 


(8o 8 (Se 81) = (88r 8/81) (10.22) 


It is then a simple matter to verify that G € G’ satisfies all the requirements of a group and 
that its order is the product of the orders of its factors. 


10.3 Representations 
10.3.1 Definitions 
Suppose that 


v= Y um 6= 6m (10.23) 
i=1 i=l 


are vectors in an n-dimensional linear vector space where the basis vectors {u;} are ortho- 
normal with respect to the scalar product 


(WI) = y Vib, (uj lu) =6;; (10.24) 
i=1 


and suppose that L is a linear operator satisfying 


La + bb) = aLy + bLé (10.25) 
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where a, b are complex numbers. The complex coefficients {w;} or {@;} are described as 
coordinates with respect to basis u and are obtained using 


Y, = (uly) (10.26) 


Further suppose that T is a linear transformation that effects an invertible one-to-one map- 
ping between old basis vectors u; and new basis vectors v;, such that 


y, = Tu, = > Tu (10.27) 


Such a transformation is described as unitary if it preserves scalar products, whereby 


(TYIT =l) = TT =1 (10.28) 
for any Y, ¢. Let 
y =Ty =D y Ta =D by) = > Vy; (10.29) 
i=l ¿ij=1 j=l 


represent the vector y in terms of the new basis v, whereby 
v= WT, (10.30) 
i=l 


It is important to recognize that the coordinates and the basis vectors transform differ- 
ently under T — notice the ordering of the indices. These transformations are sometimes 
described as cogredient for basis vectors and contragedient for coordinates. The vector y 
is physically the same whatever basis (or coordinate system) we choose. Therefore, matrix 
elements of the linear operator L should be independent of basis, whereby 


WIL16)=WIT'TLT'T |¢)=WIL'1¢’) (10.31) 


shows that matrix representations of operators in the two bases are related by the unitary 
similarity transformation 


L’ = TLT? (10.32) 


An operator L is described as invariant with respect to the transformation T if TLT? = 
L. Assuming that T is unitary, the condition TLT~! = L implies that L and T commute, 
such that LT — TL = 0. Thus, it is useful to define the commutator of two operators by 


[L T]=LT -TL (10.33) 


The symmetry group for L consists of the set of all unitary transformations that leave L 
invariant. 
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The effect of a symmetry operation R upon the basis vectors is described by a unitary 
matrix D[R] defined by 


n 


Ru, = )\u,D; ARI (10.34) 


j=1 


where the ordering of indices is designed to preserve the correspondence between matrix 
representations and group elements. Thus, successive application of transformations gives 


n n n 


R,Ryu; = Ry) DaF Y 4D, [RID A= aR R] (10.35) 
j=l k=1 j=l k=1 
such that 
RR, = R; = DIR, ]D[R)] = D[R3] (10.36) 


Therefore, group multiplication is homomorphic to multiplication of matrices and the cor- 
responding matrices are described as a representation of the group and the underlying basis 
vectors are described as belonging to that representation. Naturally we require the repre- 
sentation to obey the same properties as the group with respect to closure, existence of an 
identity element, associativity, and existence of inverse transformations. Both the symme- 
try group R and the representation D[R] are groups, but the former is defined in abstract 
terms while the latter is a realization that applies to a specific physical system. Even if 
the symmetry of the physical system is only approximate, the analysis of its dynamical 
properties can often be simplified by use of group theoretical techniques. These methods 
also provide insight — sometimes very different physical systems are described by the same 
abstract symmetry group. 

A set of nonsingular square matrices {M, i = 1, Ng} is a representation of the group 
G = {g i = 1, Ng} if 


88 j = 8% = MM) = M, (10.37) 


for all i, j,k < Ng. Clearly any representation is itself a group under matrix multiplication. 
A representation that maps each element of G onto a distinct matrix is isomorphic to G 
and is described as a faithful representation, while homomorphic several-to-one mappings 
onto matrices are described as unfaithful. Every group has a trivial one-dimensional repre- 
sentation with all M, = 1 and a trivial n-dimensional representation with all M, = Z, where 
1, is the n-dimensional unit matrix. Let D[g] denote the matrix associated with element g in 
representation D. Using any nonsingular matrix of the same dimensionality, the similarity 
transformation 


D'[g] = SD[g]S”' (10.38) 


produces a new representation D’ that is equivalent to D. Clearly there are an infinite 
number of equivalent representations of G. We can also form new representations simply 
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by arranging two known representations, D, and D,, in block-diagonal form 


= [Pi 0 
D=D,6D, 0 > (10.39) 
where the diagonal blocks usually have different dimensions and the off-diagonal blocks 
indicate null matrices with the appropriate dimensions. This construction obviously sat- 
isfies the group multiplication rules of G required of a representation and the other prop- 
erties required of a group. Reducible representations can be cast in block-diagonal form 
by a well-chosen similarity transformation and each block is itself a representation of G. 
Any block that cannot be reduced further is described as an irreducible representation. 
Irreducible representations are so important, and the term is used so frequently, that the 
abbreviated neologism irrep finds common usage. The construction of irreps of specified 
dimension is one of the central problems of group theory. 

Before we tackle general theorems perhaps it would help to exhibit a couple of the 
irreps of S}, our prototypical group. A trivial irrep assigns —1 to the reflections {P, PR, PR?) 
and +1 to the rotations (1, R, R?). Direct calculation would show that this representation is 
consistent with the multiplication table for 53, although the representation is unfaithful 
because it uses only two elements to represent six. A faithful two-dimensional irreduc- 
ible representation is provided below. Notice that the determinants are +1 for {Z, a, b} = 
(I, R, R?} and —1 for {c, d, e) = {P, PR, PR?). 


Pa CO) mala A) ras A) 


-1 0 1/1 v3 1( 1 -v3 
el, J al a > y) (10.40) 


If you take the time to spot-check some of the products, you should find that they conform 
with the S, multiplication table. Alternatively, we can use MATHEMATICA to check the 
entire set as follows. We construct a list of matrices and copy the multiplication table into 
a matrix whose elements are representations of the group elements. The function Outer 
forms all pairs of elements of G and uses Dot to multiply the matrices. The final argument 
of Outer instructs Dot to use objects at level 1, a structural device for handling lists of 
matrices. Finally, an equation is formed and simplified to verify that the left- and right- 
hand sides are equal, element by element. It is certainly a lot easier to let a machine perform 
these tedious calculations using instructions formulated at a more conceptual level! 
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_{1 0 _if-1 -V3 af mt. MB 
wp a) Stale ae)? Pale a) 
_f-1 o, ._1f1 V3). afa -v3). 
e[s i ai (i, A eili w) 


G= {i,a,b, c,d, e}; 


i a b c d e 

a b i e c d 

b i a d e c 
S3table = , ; 

c d e i a b 

d e c b i a 

le c d a b i] 


Simpli fy [Outer [Dot,G,G, 1] == S3table] 


True 


10.3.2 Example: Vibrating triangle 


Suppose that, in equilibrium, three equal masses are found at the vertices of an equilateral 
triangle and that those masses are coupled by three identical springs k along the sides of 
the triangle. For small displacements, changes in the lengths of the springs depend upon 
displacements parallel to the sides of the triangle; transverse motions contribute only in 
second order. Thus, we can approximate the energy for small-amplitude motions as 


3 
m : E 
T= 2246 +3?) (10.41) 
yl 2 ((1_ v3 BAY 
22 (xı - x) t 39 + 21 712% + 793 


O] 


The kinetic energy is obviously invariant with respect to permutation of the mass indices 
or any orthogonal transformation of the coordinates. The potential energy can be expressed 
in the form 


k 
V=> 2, tE, (10.43) 


(10.42) 


where € = {x}, Yp X2 Yo X3, Y3} is the coordinate vector and 


5 v3 -4 0 -I -v3 
V3 3 0 Ó <3 <3 
1] -4 0 5 -y3 -1 y3 


a 10.44 
"Alo 0 WB 3 WW 3 es 
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Figure 10.2. Vibrating equilateral triangle. 


is the coupling matrix. We would like to prove that the Hamiltonian for this system is 
invariant with respect to the symmetry group {Z, R, R?, P, PR, PRA), where R denotes rota- 
tion by 120° and P is reflection across the y-axis, by constructing explicit matrix repre- 
sentations for each of these symmetry operations and evaluating the similarity transforms 
GvG"! where G is any of those matrices. 

The basic symmetry operations combine a permutation of the mass indices with either 
reflection or rotation of the coordinates at each vertex. Thus, reflection across the y-axis is 
represented by 


0 p 0 
D[PI=|p 0 0 (10.45) 
0 0 p 
where 
-1 0 
p= ( 0 ) (10.46) 


is a2 X 2 matrix that performs the reflection {x > —x, y > y} at a single vertex. Similarly, 
120° counterclockwise rotation is represented by 


0 0 r 
D[R]=|0 r 0 (10.47) 
r 00 


where 


1. _v3 
a 2 

= f A (10.48) 
2 2 
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rotates the coordinates at a particular vertex. Thus, the reducible six-dimensional repre- 
sentations of the primitive symmetry operators take the form 


0 0-10 0 0 
000 1 0 0 
-10 00 0 0 
Deis 4 6 Ga (10.49) 
0 0 0 0 -1 0 
00000 1 
13 
0 0.0.0 3 =) 
3 1 
0 o o $ @ l 
dd Ne 
DIRI=| 2 7? O en (10.50) 
Ssh Oo $6 9 © 
o @ =). o 0 
3 
o. o Bol a ý 


and the remaining group elements can be constructed from these using matrix multiplica- 
tion. 

At this point we would rather use MATHEMATICA to verify the symmetry relations than 
perform tedious matrix multiplications by hand. Let 


E =dx,, Yis X2> Yo. X3> Ya} ; 


k a 1 V3 1 V3 ? 
rl, x,) [im 2 1)- fixt 2 ») 

all 

Ve 2 ae 2 1i] 


5 V3 -4 0 -1 -4v3 


V3 3 0 o -y3 -3 
wile 0 5 -V3 -1 v3 
4| 0 0 -V¥3 3 V3 -3 |’ 


\-V3 -3 V3 -3 0 6) 


reproduces the potential energy. 
k 
V == 3 E.v.E/ / Simpli fy 


True 
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Next define the primitive symmetry operators 


o o -1 0 0 0 

0 0 0 1 0 0 
pelt 0 0 0 0 | 

o 1 0 0 0 0 

0 0 0 0 -1 0 

lo o o o o a) 

NS O. "O wy ae 

0 t: o 0 Y -i 
ee ae a 

2 2 _ 

o o -¿-B o o 

lo o 5 -i o o) 


and construct the entire group as a list of matrices. 
G = {IdentityMatrix[6],R,R.R,P,P.R, P.R.R}; 


Finally, we check that the potential energy is invariant with respect to a similarity transfor- 
mation using any of the symmetry operations in the group. 


Table [v == Gllil.v. Inverse[CliM, {i, 1, Length [G]}] 


{True, True, True, True, True, True} 


Therefore, we do indeed find that v is invariant with respect to G, as expected. Furthermore, 
this G is isomorphic to a reducible representation of S}. Later we will learn how to use the 
properties of $} to determine the eigenvalues of this physical system without solving a 
sixth-order secular equation. 


10.3.3 Orthogonality Theorem 


In this section we prove an orthogonality theorem that is central to the theory of group 
representations. Although we use different notation, the methods are based upon those 
of Tinkham whose proofs, in turn, were based upon those of Wigner. It is necessary to 
develop a few preliminary lemmas first before finally arriving at the grand result. Those 
without the patience to wade through the linear algebra can skip to the finale. 


Unitarity 


First we demonstrate that any representation using matrices with nonvanishing determi- 
nants is equivalent through a similarity transformation to a representation in terms of uni- 
tary matrices. Consider the hermitian matrix 


H = >) DigiDisl" (10.51) 
geG 
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composed of manifestly hermitian terms summed over all group elements. A fundamental 
theorem of linear algebra tells us that any hermitian matrix is diagonalized by the similarity 
transformation 


D = UHU (10.52) 


using a unitary matrix U whose columns are the orthonormal eigenvectors of H and that 
the resulting diagonal matrix D has positive eigenvalues on its diagonal. Thus, 


D = X U"D[glUNU"D[g'U) = $ D'IsiD'Igi (10.53) 
geG geG 


where the matrices 
D'[g] = U”*D[g]U (10.54) 


represent G in the representation for which H is diagonal. Recognizing that D is positive- 
definite, we can construct a useful representation of the unit matrix for this representation 
according to 


DV? = X D”IgID”Ig (10,55) 


2eG 


I= DDD! = p12 » D'[gID’ lel" 
geG 


where D” is the diagonal matrix containing the elements of D raised to power a and where 
the doubly-primed representation 


D"[gl = DD’ [gD (10.56) 


results from a similarity transformation based upon D!””. Finally, we demonstrate that the 
D” [g] matrices are unitary by evaluating the products 


D"[g]D"[g]' = D"[g]1D”[g]' 


= on go Dy D' [A|D’[h]") (10.57) 
heG 


x DUD- D [g D!) 


with the aid of the aforementioned representation of the unit matrix. Note that we must 
use a distinct summation index. Using the commutation properties of diagonal matrices, 
the representation property D[e]D[h] = D[gh], and the hermitian conjugation property 
(D[g]D[h])* = D[AT"D[g]", we find that 


D"[g]D"[g]' = D! > D'[21D0'[2]D'[n]'D'[g]* 
TER (10.58) 


=D" y D'[gh]D'[gh]' = DD =1 
heG 
is indeed unitary. Note that summation of gh over all h is equivalent to summation over 
all g € Gin a different order. Therefore, one can always construct a unitary representation 
using suitable similarity transformations and we henceforth assume, without loss of gen- 
erality, that all representations employed are unitary. 
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Schur’s Lemma 


Next we demonstrate that any matrix that commutes with all elements of an irreducible 
representation must be a multiple of the unit matrix. Suppose that M is a matrix that com- 
mutes with all elements of unitary representation D, such that 


MDIg] = D[g|M = D{g]'M* = M'D{g]" (10.59) 
Multiplying by D[g] on both left and right and using its unitarity, 
M'DIg] = D[g|M" (10.60) 


we find that the hermitian conjugate M* also commutes with every D[g]. Thus, there exist 
hermitian matrices 


H,=M+M', H_=i(M-M”) (10.61) 


that also commute with all D[g]. If we show that any commuting hermitian matrix is con- 
stant, than any M = (H, — iH_)/2 must also be constant. Of course, any hermitian matrix 
can be reduced to the diagonal form D = U~'AU by a similarity transformation, so we 
use the corresponding representation 


D'[g] = U"'D[g]U (10.62) 
for group elements to write the commutation condition as 
DD'[g] = D’[g|D (10.63) 


and must prove that the elements of D are identical. Using the fact that D, ; = D,6; j, each 


LF 
element of this matrix equation takes the form 


(D; - D;) D; jis] = (10.64) 


If D, + D, then all D; ¡[g] = 0 for any g such that the similarity transformation U reduces 
representation D to block diagonal form, contradicting the stipulation that D is irreducible. 
Therefore, if D is irreducible then D; = D j for any i, j is a constant diagonal matrix, 
thereby proving Schur’s lemma. 


Equivalence of Irreducible Representations of Common Dimension 


The next intermediate result takes as much care to state as to prove. Suppose that D® is an 
irreducible representation of dimension n, and DY is another irreducible representation of 
dimension n,. Further, suppose that M is a rectangular matrix of dimensions n, X n, such 
that 


DWI[g]M = MD [g] (10.65) 


for any g € G. Then, n; + n, requires M = 0 while forn, = n, either M = 0 or Det[M] + 0. 
When Det[M] + 0, M is invertible and D and D® are equivalent representations that are 
related by the similarity transformation effected by M. 
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We may again assume that both D® and DO are unitary and that n) < n,. Using the 
fact that the representations are both unitary and irreducible, the hermitian adjoint takes 
the form 


M'D” [g] = DË [g] M" = MD? [g] = D® [gmt (10.66) 
Multiplying both sides by M and using the stipulated commutation property, we find 
MM'DO[g*] = MD? [g !]M" = MMD” [g!] = DU [gum (10.67) 


Therefore, according to Schur’s lemma, MM* must be a multiple of the unit matrix. First 
suppose that n; = n, such that M is a square matrix and MM* = cI where c is a number. 
Then 


Det| MM] = Det[M] Det[M]* = |Det[M]P = c (10.68) 


requires that c be nonnegative. If M + 0, then Det[M] + O and M is invertible. Next 
suppose that n) < n, and consider the n, X n, matrix N formed by appending n, — nı 
columns of zeros to M. Matrix multiplication 


Ny ny 
(NN"), = DUNN = Y MMi = (MM), , (10.69) 
k=1 k=1 


shows that NN? = MM? because the appended columns do not contribute. However, 
Det[N] = 0 by construction, such that [Det[M]?P = |Det[N]? = 0 = c = 0. There- 
fore, ifn, + m, then M = 0. 


Orthogonality Theorem 


We are finally ready to derive the orthogonality theorem for irreducible representations. 
Let D® and D represent irreducible representations of group G of order Ng in terms 
of nonsingular matrices of dimension n, and n, that are either identical or inequivalent; 
equivalent representations of the same dimensions are excluded. Then 


Y DO [gh yy gl = Neo, apt yy (10.70) 
geG + 
where the summation runs over all group elements. This theorem will play a central role in 
determining the number of inequivalent irreducible representations that exist with specified 
dimensions. 

Consider 


M= > D[g|XD® [s] (10.71) 
geG 
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where X is an arbitrary matrix and both M and X have dimensions n , x n;. We can demon- 
strate that M satisfies the condition of the third lemma as follows. 


DWIg]M = X DO [gh]XD® [17] 
heG 


= DO [gnIxDO [h'g] 
hee (10.72) 


= y DfalXD® [a] DO[g] 
aeG 
= MDO[g] 
Any particular matrix element takes the form 
= G) (i) al 
My, = >, >, Dap lelXp DP, [| (10.73) 
BEG P',y 


First suppose that n, + n; => M = 0. Recognizing that the matrix elements Xy „ are 


arbitrary, we are fice to choose Xg y = = 0g p Ooi such that 
0= y De ¿pLelDO, | [e] (10.74) 
geG 


The unitarity of D then requires 


i * Q) 
0 = $ DO [gl*Daslgl (10.75) 
geG 
forn, +n, 


F 
Next suppose that n; = n,, such that either M = 0 or D® ~ DY. The former shows 


that inequivalent representations are orthogonal. Different equivalent representations are 
excluded by the conditions of the theorem, leaving us only to deduce the normalization of 
the sum over products of matrix elements for the same representation. Again using the fact 
that M is a multiple of the unit matrix gives 


2, DO TDO y = Oxy (10.76) 


geG YK 


for any X. Making an inspired choice 


=d gia = A Poel Doslgl = côp y (10.77) 
geG 


we can use the unitarity of D to perform the sum over the n, identical diagonal matrix 
elements to obtain 


br N, 
Y) DO [gl Doli = cn; = Nodg, => ¢ = ~ 265, (10.78) 


H geG : 
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Therefore, we finally obtain the orthogonality theorem 


i) pe nW) N, 
Y DË, Ls! Dele] = 25,5554; co) 
geG + 


An important special case is that the normalization of matrix elements 


Y |D0,1e]]? = “a (10.80) 
geG t 


summed over group elements is independent of the indices of the matrices. 


10.3.4 Character 
Simple Characters 


In the face of an unlimited number of equivalent irreducible representations that can be 
produced by similarity transformations, it is desirable to describe irreps in terms of invari- 
ant properties. An obvious candidate is the trace of the matrices, which is invariant under 
similarity transformations. Furthermore, because all members of a class are related by 
similarity transformations, the trace of representation matrices must be the same for every 
member of a class. Therefore, the character x'?[C,] of class C, in representation DY’ is 
defined as Tr[ DY [s]| where g € C,. The characters for irreducible representations are 
described as simple while those for reducible representations are composite. In this sec- 
tion we use the orthogonality theorem to develop several valuable relationships for simple 
characters and in the next we consider composite characters. 

Specializing the orthogonality theorem for inequivalent irreducible representations to 
diagonal elements 


da N, 
Y DOIT Dg, I8] = —26; ¡0 i 


geG l 


and summing over indices, we find 


Dll xls] = Nob, oe 
geG 
or 
y N, x le] yO [sA] = NGô;; (10.83) 
k 


where N, is the number of elements in class C,. Evidently, simple characters also satisfy 
an orthogonality theorem. When the two irreps are the same, we obtain a formula 


Y LO tal = YM, [x el =No (10.84) 
g k 


that severely constrains the total number of inequivalent irreps that exist for a finite group. 
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For example, we already know that $} contains three classes: (1), {R, R2), and {P, PR, P, 
R?). The character table for S, is given in Table 10.4. Each column is labeled by class 
and the number of elements in that class. The rows list the characters for representative 
matrices in each irreducible representation. Because the identity element is in its own 
class, the first column also gives the dimensionality of each irrep. The first row is the 
trivial irrep in which each element is represented by the integer 1. For S} there is another 
one-dimensional representation in terms of parity. These rows are orthogonal, weighted by 
class size, and satisfy the normalization condition. No other one-dimensional irreps can 
exist. Using the orthogonality conditions, Eq. (10.83), a two-dimensional representation 
must satisfy the equations 


2420 43 = (10.85) 
i435 <3. = (10.86) 


Thus, we find that there is one two-dimensional irrep and no higher-dimensional irreps 
are possible because the system of orthogonality equations would be overdetermined. We 
also observe that these results are consistent with the normalization condition. Therefore, 
these simple considerations are sufficient to construct the character table without actually 
producing representation matrices. This exercise suggests that the number of irreducible 
representations is equal to the number of classes. This is a general result, but we will forgo 
the formal proof. 


Table 10.4. Character table for $}. 
151] Ci 2C, 30, 


DY 1 1 1 
po 1 1 -1 
DO 2 =] 0 


Suppose that we define a square matrix 

Mo = y) [c] (10.87) 
from the body of the character table. The inverse of this matrix is found using the orthog- 
onality formula 

Mz! = Ly [c,| = (Mm), => N o cre [c= (10.88) 

ii NX il => Ep TA ki X k] = 95 . 
k 

where the summation ranges over classes. However, because a matrix commutes with its 
inverse, we can also express orthogonality in the form 


N 


(mM), | = 2 nx e CT = 6, (10.89) 


where the summation ranges over irreps. 
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Therefore, orthogonality of character 


EN [CT 2” Cc = Nob T 
F wje le _ Nas 10.91 
PERE es (10.91) 
m i 


can be expressed in terms of either class or irrep and both can be useful in construction of 
character tables. Here we chose Greek indices for irreps and Latin indices for class. 


Composite Character 


Recall that a reducible representation can be cast in block diagonal form with a suitable 
similarity transformation. Upon reducing such a representation as fully as possible, such 
that each block is irreducible, we write 


Dig] => a,D” Ig] (10.92) 
H 


where the summation ranges over irreducible representations and the nonnegative integers 
a, count the number of times each irrep D™ appears in D. The same irrep may appear 
in several equivalent forms within D and a, counts each equivalent form equally. The 
summation is schematic — it enumerates the structure along the diagonal of the block diag- 


onalized D. The trace of D then becomes 


xia = > a,x Ia] = x[Ci] = > 4,x [0] (10.93) 


u u 


where C; is the class that contains g. Thus, the character for each class in a reducible 
representation is the sum over the simple characters for irreducible representations of the 
same class weighted by the number of appearances of equivalent forms of those irreps. 
The orthogonality theorems for simple characters facilitate projection of a, 


y Nx" [e] x[e] = X y a NiX” [C x [C,] = Nea, (10.94) 
i ip 
whereby 
1 1 
= D Nx” [eC] x[C] = 7 D xP lslxlel (10.95) 
a, Ng 2 x [el x[c,] Ne Š sl'vlg 


Suppose that D = DO is irreducible, such that a, = Oey Then 
2 
DSN [ell = Ne (10.96) 


is consistent with previous results for the character normalization summed over classes of 
a particular irrep. In fact, this condition provides a test for irreducibility. For an arbitrary 
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representation, one finds 
2 
DNAxleil = Ne (10.97) 


with equality if and only if y is simple and D is irreducible. Expanding y again gives 


DN Dix [Cie [Ci] = No >, laf (10.98) 
m 


i mv 


such that 
1 
No 2,5 Lele. ]P= 2, la, =1 (10.99) 


If computation of the quantity on the left gives 1, then the representation is irreducible. 
If it happens to be 2, then D contains two different irreps, with each occurring just once, 
because the a,, are nonnegative integers. If it happens to be 4, then D contains either one 
irrep occurring twice or four different irreps all occurring once. 


Dimensionality Theorem 


Table 10.5. Multiplication table for S, with elements arranged to place / on the main diagonal. 


S; I a b c d e 
I I a b c d e 
a'|b Ia d e c 
b! |a b I e c d 
Etle d e Ia b 
dd e c b I a 
alle c d a b Í 


The regular representation of any finite group is constructed by arranging the group 
elements to place the identity element along the main diagonal of the group multiplication 
table. The matrices D“*9[g] are then obtained by replacing g in the multiplication table 
by 1 and all other elements by 0. Thus, if we label columns by g; we can label rows by g;!. 
We illustrate this construction using S} again. Using the multiplication table in the form 
shown in Table 10.5, we obtain the matrices 
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1000 0 0 
0 10 0 0 0 
0 0 1 0 0 0 
DH] = 00010 0! (10.100) 
000 0 1 0 
000 0 0 1 
0 10 0 0 0 
0 0 1 0 0 0 
100 0 0 0 
Dla] = 0000 1 01° (10.101) 
000 0 0 1 
00 0 1 0 0 
00 1 0 0 0 
100 0 0 0 
0 10 0 0 0 
D[b] = 00000 41) (10.102) 
00 0 1 0 0 
000 0 1 0 
0 0 0 1 0 0 
0000 0 1 
000 0 1 0 
Dic] = 10000 ol (10.103) 
0 0 1 0 0 0 
0 10 0 0 0 
000 0 1 0 
00 0 1 0 0 
000 0 0 1 
Dld] = 01000 0P (10.104) 
100 0 0 0 
0 0 1 0 0 0 
000 0 0 1 
000 0 1 0 
000 1 0 0 
Dle] = 00100 0 (10.105) 
0 10 0 0 0 
100 0 0 0 


and can easily verify that these obey the group multiplication table. 

The character of the identity element of the regular representation is N¿ while the 
characters of all other classes are zero by construction. Furthermore, the simple character 
XU] = N, for the identity element of an irreducible representation is just the dimension- 
ality of irrep u. Application of the projection theorem, Eq. (10.95), 
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1 1 
(reg) — u) * (reg) = (reg) — 
ae = Ng > N, x” [018 [0] = m =a 9 =N, (10.106) 


then shows that the number of times each irrep appears in the regular representation is 
equal to its dimensionality. Finally, the composite character of the identity element of the 
regular representation 


er] = X ax” = No = yim (10.107) 
H H 


provides a simple relationship between the dimensionality of irreps and the order of the 
group. This result is general because neither of these quantities depends upon the repre- 
sentation; the fact that we obtained it with the aid of the regular representation does not 
limit its generality. Therefore, we obtain the dimensionality theorem 


SINT =N (10.108) 
H 


For example, the fact that Ng = 6 for Sy is sufficient to deduce that S} supports exactly 
two different one-dimensional irreps and one two-dimensional irrep because 1? + 1? + 
2? = 6 is the only solution to the dimensionality equation in terms of positive integers Nu 
Furthermore, the regular representation contains both one-dimensional irreps once and the 
two-dimensional irrep twice. 


10.3.5 Example: Character table for symmetries of a square 


In the preceding few subsections we derived many abstract theorems and we need a non- 
trivial example to illustrate their use. Consider the symmetries of a square, excluding oper- 
ations that twist its sides. These symmetries are among the permutations of the four ver- 
tices and can be labeled as 


I = [1234] (10.109) 
R = [2341] (10.110) 
R? = [3412] (10.111) 
R? = [4123] (10.112) 
P = [4321] (10.113) 
PR = [1432] (10.114) 
PR? = [2143] (10.115) 
PR? = [3214] (10.116) 


where R indicates a rotation (cyclic permutation) and P a reflection across the x-axis (parity 
operation). The combinations of reflection and rotations then fill out the list of permissible 
permutations. Table 10.6 provides the group multiplication table with an extra column for 
the order of each element. This table shows that multiplication is associative, closed, and 
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Table 10.6. Group multiplication table for symmetries of a square. 


Square | / R R? R P PR PR? PR Order 
Í I R R? R? P PR PR PR 1 
R R R? R? I PR P PR PR 4 
R? R? R? I R PR PR P PR 2 
RÈ RÈ I R R? PR PR PR P 4 
P P PR PR PR I R R R 2 
PR PR PR PR P RÈ I PR R 2 
PR? PR PR P PR R R I R 2 
PR? PR P PR PR R R R I 2 
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that there is an inverse for each element. Therefore, this set of operations forms a group of 
order 8 and, because each element is a member of S,, we conclude that the symmetries of 
a square are a subgroup of S,. 


The calculations below establish class membership. 


PRE! = PRP = PPR = RÈ >R ~ R? 


RPR! = PRR = PR = P ~ PR? 


RPRR™! = PR = PR ~ PR? 


Thus, the five classes consist of 


C =} 

C= {RR} 
C= {R} 

C, = [P, PR?) 
C, = (PR, PR?) 


(10.117) 
(10.118) 
(10.119) 


(10.120) 
(10.121) 
(10.122) 
(10.123) 
(10.124) 


and there are also five irreps with dimensions d = {1, 1, 1, 1, 2}, the only solution to d-d = 8 
among positive integers. Placing the trivial one-dimensional representation D® [g] = 7 on 
the first row and the dimensions d on the first column gives us the start of a character 
table (10.3.5). We can also use the parity of the permutations as a second one-dimensional 
representation giving row 2. By inspection we verify that these rows are orthogonal and 
normalized properly when weighted by class size. 


Table 10.7. First steps in construction of the character table for symmetries of a square. 


x 16. 2 G 2G, 20; 
Do/i I Ii 1 1 
D?};1 1 1 -1 -l 
po | 1 
D® |1 
DƏ | 2 


Next, we identify A = (1, R?) as an invariant subgroup by examining the multiplication 


table and observing that A commutes with all other elements of G. Thus, its factor group is 
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F = G/A = {A, RA, PA, PRA} and an irrep of F must be an irrep of G by homomorphism. 
The multiplication table for F is given in Table 10.8. 


Table 10.8. 


Since F is abelian each element is in its own class, such that F supports four one- 
dimensional irreps. Every element other than the identity (A) is order 2, such that irreps 
must assign them values of +1. Thus, the irreps are {1, 1, 1, 1}, {1, 1, -1, -1}, (1, -L 1, -1}, 
and (1, —1, —1, 1}; it is a simple exercise to verify that these representations obey the mul- 
tiplication table for F. We can now use the homomorphism with G to complete rows three 
and four of its character table (Tab. 10.3.5). It is comforting to observe that the orthonor- 
mality conditions are satisfied at this stage. 


Table 10.9. 

YG, 26- C 20, 20; 
DO ]1 1 1 l 1 
DƏ |1 1 1 -1 -l 
DƏ |1 -1 1 1 -1 
Deli: << Y =i- 4 
D® | 2 


Finally, we use orthogonality relations to produce a system of four linear equations for 
the four remaining elements of this table. 


DEDO tie Ed +e? = (10.125) 
2428 +8 - 28” A =0 (10.126) 
DL +48 + 28” — 28” =0 (10.127) 
DA ap ay aye =o (10.128) 


After some straightforward algebra, we obtain row five and complete the character table, 
as shown in Table 10.10. You should verify, for practice, that both the row and column 
orthonormality relations are satisfied for this character table. The reasoning employed here 
is typical of that used to construct character tables. There are variations in the sequence 
of steps, of course, depending upon what information is available. Sometimes one can 
dispense with construction of factor groups; other times more than one is both available 
and helpful. 
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Table 10.10. Completed character table for symmetries of a square. 


y 16 +. € 30 26 
DOI I I I l 
DƏ |i 1 1 -1 -l 
DƏ |1 =f 1 1 1 
pe |1 -1 1 -1 4 
DƏ|2 0 2 0 0 


Next we construct a reducible dimension-four representation using the permutation 
matrices for four objects; examples from each class are listed below. 


1000 
Di1, 2,3,41 =|) » i o=% =4 (10.129) 
0001 
010 0 
mae : z oo (10.130) 
1000 
0010 
maaf a Jones (10.131) 
010 0 
0001 
maaan : i oo (10.132) 
1000 
0010 
maaf x o os (10.133) 
0001 
Then 
Wg Datel = (1x4 +20 + 1x04 2x0" 42x27) =3 (10.134) 


shows that this representation contains three irreps, once each (1? + 1? + 1? = 3). Thus, D 
contains the two-dimensional irrep plus two of the one-dimensional irreps. We can deter- 
mine which one-dimensional representations are within D using 
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1 “1 

a= g 2AM Xe = gL X4 42x04 1x042x0+2x2) = 1 (10.135) 
k 
1 | 

=D Xe = gx 4+2x0+1x0-2x0-2x2)=0 (10.136) 
G k 
1 + 1 

=D am Xe = GLX 4-2x0+1x0+2x0-2x2)=0 (10.137) 
G k 
1 “1 

MM GU 204 1x0 2x042x2) =1 (10.138) 
k 

such that 

D=D” D” D Sy, =1+M + (10.139) 


is also satisfied. If one has suitable reducible representations at hand, relationships of this 
type can sometimes be used to compute the character table, perhaps instead of using factor 
groups. 


10.3.6 Example: Vibrational eigenvalues of square 


Suppose that, at equilibrium, four equal masses m are found at the vertices of a square and 
that these masses are connected by four equal springs k along its sides and two springs 
kk along the diagonals. We consider small-amplitude vibrations and expand the potential 
energy to second order in the spatial coordinates. Let é = {x,, Yi X2, Ya, X3» Y3 X4 Y4} rep- 
resent a state vector for the system where the masses are labeled sequentially. The kinetic 
and potential energies are given by 


1 3 

T= z” E & (10.140) 
1 

V= z% > Vi 88; 


2 
¿[e a +ba eel + (x3 = 14) + (y 34) +2 = =) 


2 vV 
amr] 


where the symmetric matrix 


(10.141) 
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2+K —K —2 0 =K K 0 0 
-K 2+K (0) 0 K =K (0) —2 
-2 0 2+K K 0 0 =K =K 
11 0 0 K 2+K 0 -2 —K —K 
a x x 0 0 Bee x 2 0 i 
K =K (0) —2 -K 2+K 0 0 
0 0 =K =K —2 0 2+K K 
0 -2 =K =K 0 0 K 2+K 


describes small-amplitude vibrations. (Please check this matrix!) The equations of motion 
are then 


. OW 
mé, = = a> vë; (10.143) 


and normal modes satisfy eigenvalue equations of the form 
é= A Exp| —iw, 1] => wove, = we, (10.144) 


where wo = k/m. Note that we use Greek indices to label the eigenvectors and Latin 
indices to label components. In this section we will demonstrate that one can deduce the 
eigenvalues using properties of the symmetry group for this system without solving a sec- 
ular equation. 

We begin by constructing representation matrices for typical members of each class. 
Consider first the parity operation 


0.0.0 p 
DIP =| o a (10.145) 
pooo 
where 
1 0 
p=(5 a (10.146) 


inverts the y coordinates at each vertex and D[P] reflects vertices across the horizontal 
midplane of the system. Similarly, the basic 90° rotation of vertices is represented by 


0 r 00 
0 0 r O 
D[R] = 000 r (10.147) 
r 00 0 
where 
0 1 
r= le o) (10.148) 


operates on the coordinates at each vertex. You should verify, by explicit calculation, that 
the potential energy is invariant with respect to every member of the symmetry group for 
the square, such that v = D[g]vD[g!]. 
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The transformation matrices for this representation are obviously reducible. To deter- 
mine their irrep content, we evaluate the composite characters for each class 


C, => =8 (10.149) 
C, = {R, RÈ} = x, =0 (10.150) 
O, = [RP] =% =0 (10.151) 
C, = {P, PR} = x, =0 (10.152) 
Cs = (PR, PR*} =% = 0 (10.153) 

to obtain 
Dla =D mlxlc]f=8 (10.154) 
Y NG 


Thus, this representation turns out to be the regular representation 

D = D” @ D? @ D® @ D® 92D” (10.155) 
Upon application of the similarity transformation that diagonalizes v, we expect to find 

y = Diag[(A,, Az, Ag Àg Ag 1 Às p Aso, À5s,2}] (10.156) 


where À; , and A; are the eigenvalues for the two occurrences of DO. 
Recognizing that traces are invariant under similarity transformations, we expand 


Tr Digly = Ay VP [g] + 4, XP Lg] + As HOLS] + ALVES] + (As, + As) VL] (10.157) 


and use the character table to deduce five linear equations for the eigenvalues 


Tr D[]v = 8+ 4k =A, +A, +d; +Aq+2(As +59) (10.158) 
Tr D[R]v = 0 =A, +A, = à; = àg (10.159) 
TrD[R?] v = 4« =A, +A, +A, +A, —2(A5; +25) (10.160) 
Tr D[P]v = 4 =A, -A, +A, -Ay (10.161) 
Tr D[PRlv = -4K =A, -A,-A, +A, (10.162) 


for our particular representation. Combining the first and third equations gives 


(4g; + A52) =2 (10.163) 
such that 

A, +A, +à; +2, =4+4x (10.164) 

A, +A, - A; -AL=0 (10.165) 

A, -4A, +43-Ay=4 (10.166) 


A, -À -d +2, =-4k (10.167) 
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From the first pair we deduce 
A, +A, =A, +12, = 242k (10.168) 


and from the second pair 


A, —A, =2-2k (10.169) 

A, -—Ag=24+2k (10.170) 
Thus, we find 

A, =2, A, =2k, A, =24+2kK, 1A¿=0, (As; + Aes) =2 (10.171) 


Finally, we require an argument that distinguishes between the two occurrences of DC). 
On physical grounds we expect to find three modes with A = 0 that correspond to either 
rigid rotation or to horizontal and vertical displacements of the center of mass. The null 
eigenvalue for D® can be identified with the rotational mode because this representation 
is one-dimensional, leaving one of the two-dimensional representations to describe trans- 
lations of the center of mass. Therefore, we conclude that the set of eigenvalues for this 
system is 


A = (2, 2x, 2 + 2x, 0, 2, 2, 0, 0} (10.172) 


where we account for degeneracy and choose, arbitrarily, to place the rigid translations 
last. 


10.3.7 Direct-Product Representations 


Suppose that DW[R] and D [R] are two representations of the same symmetry opera- 
tor R that act upon different coordinates and that operators acting on different coordinates 
commute. Let (u,,i = 1, m} and {v j j = 1, n} represent the basis vectors for these rep- 
resentations, such that the m x n-dimensional set of basis vectors for the direct-product 
representation D“®” is given by {u;v ;} with two indices. Thus, 


m n u) w) m n ugy) 
Ruy, = Y) Y Di [Rv Dij IR] = >) Y uv Dui [R] (10.173) 
k=1 l=1 k=1 l=1 
where 
(U@yY) (w) (y 
Dia) [R] = Dei RIDI AR] (10.174) 


are the matrix elements of the direct-product representation. Similarly, the character for a 
direct product 


UR] = TED [A] = Y Dina TR] = Y Di [RIDIR] (10.175) 
EJ ij 


10.3 Representations 401 


is simply the product 
HOR] = yP [R] y” [R] (10.176) 


of the characters of the two representations. Direct-product representations are usually 
reducible as a Clebsch-Gordan expansion 


pve) =- 2A uvly)D” (10,177) 
Y 


where the summation over irreducible representations D is interpreted as ® and where 
the reduction coefficients 


1 
(HV IN= aD xls Isle Isl (10.178) 


G geG 


are nonnegative integers. 
The unitary matrices D“®” can be reduced to the block diagonal form 


po» 0 0 0 


0 = 0 0 
aa 
CD“! = a. di eee a (10.179) 
0 0 0 


where p = (u,v | y) represents, schematically, the number of occurrences of D” in the 
Clebsch-Gordan expansion and where the pattern is replicated for all irreps y. The unitary 
matrix C is the matrix of column vectors 


ma EN 


ij 


kyl 


i ‘Vw (10.180) 


whose Clebsch-Gordan coefficients are arranged in a symbol based upon bra-ket notation. 
Note that specification of the irreps generally requires two labels because there may be 
multiple occurrences. Using the orthonormality of the bases and the unitarity of C, one 
obtains the orthogonality relations 


Vo |e Vi vin dy 
>| K i ae j Eb (10.181) 
ij 
OA tie (10.182) 
i j| k k li j ith jj 
WOK 
such that 
y , * 
un =D ps Wyk (10.183) 
Yok J 
where we define 
A v yly é 
a i Ae j al (10.184) 


for the inverse coupling according to the conventions for bra-ket notation. 
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A product representation is described as simply reducible when ( u, v | y) is either 0 or 
1 for all y and when each g`! is in the same class as g. The index ¢ is then superfluous and 
one can determine the CG coefficients, up to a phase, relatively simply. If we invert the 
similarity transformation 


(ugy) (u) v) 4 
Dy ijlg] = Dr ilg1Dy,¡lgl = 2; (i 


Erd 
i k 
yk KkK J 


k i j 


7) OP [g] l ” 


7) (10.185) 


and then multiply by D” [g]*, sum over group elements, and apply the orthogonality the- 
orem for irreps, we obtain 

YY pp Y 

kit j 


where N¿ is the number of group elements and n, is the dimensionality of DY. Taking 
ť =i, j’ = j,k’ = k will provide at least one nonzero CG coefficient and we are free to 
choose it to be positive. Varying i’, j’, k’ will then provide enough information to determine 
the remaining CG coefficients. You are probably familiar with the Clebsch—Gordan (CG) 
coefficients for coupling of two angular momenta, but we are using similar notation in a 
more generic sense applicable to direct-product representations of any group; naturally the 
CG coefficients depend upon the particular group involved. For many groups it is possible 
to construct bases that make the Clebsch-Gordan coefficients real. 


(v) 102) y 


w) o N, 
> Dy ilglDy ¡lg1Dg klg] = at , 
n, W j 
geG Y 


y * 
4 (10.186) 


Example 


Let us return once more to the problem of small-amplitude vibrations of an equilateral 
triangle. The six-dimensional representation we employed is actually a direct product of 
a three-dimensional representation of S} describing the permutations of the mass indices 
and a two-dimensional representation of a group containing rotations and reflections that is 
isomorphic to 53. The two-dimensional representation is irreducible, but the three-dimen- 
sional representation is not. Suppose that we arrange the group elements as (1, R, R?, P, PR, 
PR?). As worked out in a previous section, there are two one-dimensional irreps with char- 
acters y® = {1, 1, 1, 1,1, 1) and y? = (1,1, 1, —1, —1, —1} and a two-dimenional irrep 
with character x? = (2, —1, —1, 0, 0, 0}. A faithful three-dimensional representation con- 
sists of simple permutations of the three mass indices, and a little thought, or explicit 
construction, will reveal that its character is xO = {3, 0, 0, 1, 1, 1}. Hence, the character of 
the product representation is xo = {6, 0, 0, 0, 0, 0). The reduction coefficients are 


1 
6,211) = 6 FE 0,0, 1, 1, 1} » {2, —1, —1, O, O, O} » {1, 1, 1, 1,1,1} = 1 (10.187) 


1 
3,211) = = 9113,0,0,1, 1, 1) (2,1, -1,0,0,0} (1,1, 1,1, -1-1) = 1 
(10.188) 


1 
(3,2 | 2) = 6 2o 0, 0, 1, 1, 1} * (2, —1, —1, 0, 0, O} » (2, —1, —1, 0,0,0} = 1 (10.189) 
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where this notation indicates multiplication of corresponding elements of each list and 
summation over the list of products. Thus, the Clebsch-Gordan expansion takes the form 
DE% = DY @ DO) @ DË. 


10.3.8 Eigenfunctions 


The Hamiltonian H commutes with every operator in its symmetry group G, such that 
|A, G| = 0. Let j denote the labels for an irreducible representation DY of G; then 
the matrix representation of H will commute with each irreducible DW[R], such that 
|H, Di [R]] = 0 for all R € G. According to Schur’s lemma, the matrix representation 
of H within the group space of DY) must be a multiple of the unit matrix. In other words, 
there must be a set of n linearly independent eigenfunctions {W), m = 1, n} with the same 
eigenvalue Ej» such that 


HYD = E yy? (10.190) 


for 1 < m < n,. This set of eigenfunctions is described as a multiplet that is degenerate 
with respect to H with degeneracy n,. If we start with one eigenfunction in a multiplet, the 
others may be generated by applying symmetry operations because, according to group 


closure, RW) produces the linear combination 


m'=1 


where the coefficients DP m [R] form a nonsingular n ¡dimensional matrix representation 
of operator R within the set of eigenfunctions belonging to eigenvalue £,. Thus, if H and 
R commute, sets of eigenfunctions that are degenerate with respect to H transform among 
themselves under R. If we now enlarge the basis to include all possible values of j, H is 
represented by a diagonal matrix with the set of e; along the diagonal, each appearing n, 
times, while representations of R take the fully reduced block-diagonal form 


DOR] 0 0 
D[R]=| 0  D®[R] 0 (10.192) 
0 O° 


where the superscripts enumerate £;. Each WD is described as belonging to row m of 
irrep j. 

The energy eigenvalues £, for each irrep j are usually distinct but occasionally one 
finds two or more distinct irreps that are degenerate in energy. These situations generally 
indicate the existence of an accidental or dynamical symmetry and not all eigenfunctions 
with the same energy are connected by the symmetry operators within G. Knowing the 
degeneracy n, of an energy level is not always sufficient to determine the dimensionality 
n; of the irreps at that energy; one must also verify that the corresponding D[R] are irre- 
ducible. Testing for irreduciblity can be done using character tables for G. On the other 


404 10 Group Theory 


hand, accidental degeneracies are often broken by either perturbations or by subtle effects 
neglected by simpler models. A famous example is the Lamb shift in which the acciden- 
tal degeneracy between the 2s and 2p,,, states of the hydrogen atom is broken by the 
fluctuations of the electromagnetic field in vacuum that are predicted by quantum electro- 
dynamics. 

Assuming that our system does not possess accidental symmetries, the orthogonality 
theorem for irreducible representations provides important selection rules. The unitarity of 
R requires 


GI] QO. 0) (0) 


Win? |W) = Rin? [RUD = Da [DA ye RIA RA] (10.193) 
AN 
and averaging over the group gives 
(us ) | YO) = Y (y (i | We) = Pe cal [R]* DÝR R] = y Dwr 0% Pl) i 
AN Jo hat 


(10.194) 


where n , is the dimensionality of irrep j. Thus, eigenfunctions belonging to different irreps 
or to different rows of the same irrep are orthogonal. Next, suppose that V is invariant with 
respect to R, such that [V, G] = 0, for any R € G, where G is the symmetry group of H. 
Then 


(ur |v |W) = (uw [RVR] YD) = Y (Wx? |V |u) Dv wIRIDemER] (10.195) 
AN 
can again be averaged over R to obtain 


ly >| vw) = “if mn y dv |v |v’) (10.196) 


de 


Observe that these matrix elements vanish unless j = j’, m = m and are independent of m 
for specified j. Therefore, we can write 

[V, G1 = 0 = (Um |V |W) = V5, Sp (10.197) 
where V; depends only upon the irrep to which these states belong. This is also a conse- 


quence of Schur’s lemma, but the foregoing derivation provides some practice using the 
orthogonality theorem. 


10.3.9 Wigner—Eckart Theorem 


Although most operators of interest will not be invariant with respect to the symmetry 
group of a Hamiltonian, it is often possible to express them in terms of operators that trans- 
form among themselves according to an irreducible representation of that group. Consider 
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an operator VY? that transforms with irrep j according to 


RVG Rt = 2 VOD? u[R (10.198) 


for any R e G. We assume that DY®) is simply reducible. Using the orthogonality theorem 
to average matrix elements of the form 


(y m [Vat | wi?) = (y m | RRVA RR] yi) 


m m 


A IA) En) 0) (10.199) 
= Y (We? | vie | Wi) DX [R] Di RID mL RI 


àX, M’ 


over all R € G then -= 


(vin? [vi v2) = Y (ui? vir u) Dim [RI Di RIA 1] 


Ng AM’ 
wo tela) O e l 


where ny is the dimensionality of irrep J’. Notice that the summation on the right-hand 
side depends upon {j, J, j’} but is independent of {m, M, m’} — the dependencies upon row 
indices for each irrep are contained entirely within the Clebsch—Gordan coefficient, which 
is hopefully known for G. Therefore, we obtain the Wigner—Eckart theorem 


(oi? ee) (o 


m 
where the reduced matrix element (j’\|V 117) is independent of row and can be evaluated 
for the most convenient selection of {m, M, m’}. The reduced matrix element can also be 
obtained using 


UE Y [aryl y, 


i UM 

but that method offers no obvious economy of effort. Note that additional phase or normal- 
ization factors are sometimes included in the definition of the reduced matrix elements. 

The Wigner—Eckart theorem is usually presented in textbooks on quantum mechan- 
ics in connection with rotational symmetry and angular momentum eigenstates, but the 
group theoretical derivation is more general and applies to any group with simply reduc- 
ible direct-product representations. Nor is it limited to quantum mechanics. Generaliza- 
tions can also be made for direct-product representations that are not simply reducible. 


(10.200) 


vols (10.201) 


N 


4 (10.202) 


10.4 Continuous Groups 
10.4.1 Definitions 


Suppose that R is a finite group containing n elements (R,, a = 1, n}. The group multipli- 
cation table RR, = R, can be described as a discrete function c = ¢[a, b] that produces a 
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unique output index c for each pair of input indices a, b. Functions of group elements can 
be defined on n points by f[a] = f[R,,]. For example, the representation DY? [a] is a group 
function while the character y“[a] is a class function. The set of points is described as the 
group manifold and is discrete for a finite group. 

The elements of a continuous group are labeled by a finite number of continuous 
parameters instead of a discrete index. We assume, for simplicity, that all parameters 
are essential in the sense that there exists no smaller set of parameters, defined as func- 
tions of the original parameters, capable of uniquely identifying every group element. If n 
parameters are essential, the group is described as an n-parameter continuous group. Let 
a = {a,i = 1, n} represent the set of parameters and R[a] represent elements of group R. 
The continuous parameters a; may have either finite or infinite ranges depending upon the 
nature of the group. If the parameters have finite ranges, the group manifold is described as 
compact. We will assume that the group manifold is simply connected but generalizations 
are possible. Group multiplication is represented by the law of composition 


Rla]R[b] = R[c] = c; = ¢;la, b] (10.203) 


where each c; falls within the allowed range for parameter i and where each ¢,[a, b] is a 
continuous function of 2n variables. This requirement is analogous to the closure property 
of finite groups. Similarly, there must exist an identity element a such that 


R[a | R[b] = RIDIR [a®] = R[D] (10.204) 
and every element a must have an inverse a such that 
R [ā] Ria] = Rla]R [a] = R[a | (10.205) 


It is usually possible to define the group such that all parameters of the identity element 


A . (0) A . 
vanish; in other words {ai = 0;i = 1, n}. Henceforth we will assume, unless stated other- 
wise, that the identity element is at the origin of the parameter space. Finally, the associa- 
tive property 


Rlal(R[b]R[c]) = (Rla]RIDDR[c] = dla, ¢lb, cl] = d[ dla, b], c] (10.206) 


is usually the most restrictive. 
We will concentrate on Lie groups which perform transformations 


olle k= onl) = x = £l[x;a] (10.207) 


upon an n-dimensional vector space that are described by n functions of m essential para- 
meters that are analytic in both x and a. For example, consider the two-parameter group of 
nonsingular linear coordinate transformations on a line according to 


x =Rlalx = x = a +(1+a,)x (10.208) 
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where —co < a, < oo and a, + —1. The identity element is (a, a,) = (0, 0} by construction. 
Group multiplication is closed according to 


R[a]R[b]x = Rial», +(1+ b,)x) 
= ay + (1 + a) (bo + (1+b1)x) (10.209) 


= (a, + by (1 + a) + (1 +4 +b,)x 


such that 
ola, b] = ay + by (1 + a) (10.210) 
fp, la, b] = a, +b, (1 +a,) (10.211) 


are continuous functions within the specified (infinite) ranges. Each element a = (a,, a,} 
has an inverse element a = (a,, @,} where 


== (10.212) 

0_— 1 +a, ` 

ay =-> > (10.213) 
a 


are analytic functions because a, + —1; furthermore, a, + —1 is in the proper range. The 
associative property requires 


olola, b], c] = dla, olb, cl] (10.214) 


Although it might seem obvious that linear transformations are associative, perhaps it is 
instructive to verify this property explicitly. Expansion of the left-hand side gives 


do [fao + bo (1 +a), a, +b, (1 + a,)), {co c,)] 
= ay +bp(1+a,)+cp(1 + a, +b, (1 +a,)) (10.215) 
= ag + bo + Co + aby + aycy + biCo + abico 

9, [fao + ba (1 +a), a, +b, (1 +a, )}, {co c,)] 
=a, +b (14a) +e,(1 +a; +b, (1 +a,)) (10.216) 
=q; +b, +c; +a,b, +a,c,+b,c, +ajb,c, 

while expansion of the right-hand side gives 

Po [fao a,), {by + co (1 +b,),b, +c, (1 +b,))] 
= ay + (by + co (1 + b,))(1 +a) (10.217) 
= ag + bo + Co + aibo + a¡cy + by Co + a,b, Cy 

9 [(a, a,), {by + co (1 +b,),b, +c, (1 +b,))] 
=a; + (0, +c (1 +b,))(1 +a) (10.218) 


=a,+b, +c; +a,b, + a,c, + bici + a,byc, 


408 10 Group Theory 


Thus, these ¢, do satisfy the associative property and R satisfies all the requirements of a 
continuous group. Notice that this group is nonabelian. 

The requirement that every transformation be invertible means that one can always 
solve for x; in terms of {xi}, which requires that the Jacobian 


Ox, Ox, 

Fas : :1%0 (10.219) 
Es SEn 
Ox, e Ox 


be nonsingular. The completeness condition requires successive transformations 


x = lx; a] (10.220) 
x” = é |x';b] (10.221) 


to correspond to 

x” = E[x;c] (10.222) 
where c is an allowed parameter set, calculable in terms of analytic functions 

c, = mla; b] (10.223) 


of the two constituent sets of transformation parameters. 


10.4.2 Transformation of Functions 


Suppose that new coordinates x’ are obtained by applying group element R to the old 
coordinates x. This transformation of variables is represented schematically by 


x’ = Rx (10.224) 


even if R is not simply a matrix. Thus, R is described as an operator that acts on the 
coordinate variables. We wish to construct an operator R that acts on a function f[x] to 
produce a new function f[x”] that has the same values for the transformed coordinates, 
such that 


x = Rx > f |x] = Rf = f [Rx] (10.225) 


It is important to understand that a forward transformation of a function is produced by an 
inverse transformation of its arguments. Perhaps the simplest example is rotation within 
a plane. Think of the function f[x], where x = {x}, X2}, as a surface. Positive rotation of 
that surface relative to fixed coordinate axes is equivalent to negative rotation of those 
axes relative to a fixed surface. For example, the surface f[x, y], shown on the left side of 
Fig. 10.3, has a positive ridge in quadrant 4 with x > 0, y < 0. Positive rotation of the 
surface by 90° relative to fixed coordinate axes yields the surface f[x, y] with its positive 
lobe in quadrant 1. Alternatively, rotation of the coordinate axes relative to a fixed surface, 
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fixy] 


Figure 10.3. Positive rotation of a surface is equivalent to negative rotation of its coordinates. 


such that x > —y, y > x, relabels the quadrant 4 as quadrant 1 and produces the same 
transformation. We describe R as an active transformation of the function and R”! as a 
passive transformation of the coordinate system. 

In the next section we will develop a general technique for constructing R given R, but 
now we preview this method by analyzing infinitesimal rotations. Suppose that x = {x,, X2} 
represents the original coordinates and x’ = R[e]x = (xi, x2} represents coordinates after 
an infinitesimal rotation 


Rae ( 7] R= ( i i (10.226) 
€ 1 -e 1 
A first-order Taylor expansion 
Of 1, $, of of 
f[Ro'x| = fix] - ax, (x, - x,) - Bn (x5 — x,) = f] + e( 50 aes (10.227) 


relates functions at new and old coordinates where negative signs multiply the derivatives 
because the argument R”!x involves the inverse of R. Therefore, the operator that trans- 
forms from f[x] to Fix] is 


0 ð 
R=1 = 10.228 

+ of ax, Ži =| ( ) 
for infinitesimal e. Notice that if we employ polar coordinates with x = {r Cos[6], r Sin[6]}, 


infinitesimal rotation of a function is performed by the differential operator 


ð 
R=1-e 10.229 
esp ( ) 


10.4.3 Generators 


The continuity of Lie groups permits a finite transformation to be performed in a sequence 
of smaller steps. Before confronting the general technique, consider the specific example 


410 10 Group Theory 


of coordinate translations. Let T[a] be a translation whose effect upon coordinates is given 
by 


Tlajx=x+a (10.230) 
Translations form an abelian group 
T[b]T [a]x = T[b](x + a) = x+ a + b = T[b]T [a] = T[a]T[b] =T[a+b] (10.231) 


with identity element T[0] and inverses T~'[a] = T[-a]. The associative property is also 
satisfied. We seek an operator 7 [a] that performs a corresponding translation of an analytic 
function 


Tlalflx] = f [To 'lala| = fix- a] (10.232) 
Any finite translation can be performed in a sequence of infinitesimal steps 
Tial = jim (7|$]) (10.233) 


where an infinitesimal translation is evaluated using the definition of derivative as 


dflx] 
dx 


d 
Tlelflx] = fix- £] = fl - Se = Tle] = (1 - e£) sta (10.234) 


Thus, a finite translation takes the form 
dy d 
Ta] = lim (1 = £) = Exp| a (10.235) 
ndx x 


where a function g[A] of an operator A is evaluated by substituting that operator into the 
Maclaurin series for g[x]. Although this operator is really little more than a fancy repre- 
sentation for the Taylor expansion 


fi-a] = Y) A h-a) 100 = Expl -a| 00 (10.236) 


co 
n=0 


it serves as a prototype for the definition and behavior of generators of Lie groups. 
We identify p, = —i0/0x as the generator of infinitesimal translations along the x-axis, 
such that arbitrary group elements can be expressed in the form 


Tla] = Exp|-iap,| (10.237) 


Translations with respect to the x-axis are obviously a subgroup of translations within 
three-dimensional space and we can generalize to three dimensions 


T [a] = Exp|-a-¥] = 7/3] s P] = 1 [7 -al (10.238) 
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and obtain an operator representation of the three-dimensional Taylor expansion. In gen- 
eral, an n-parameter Lie group G is described by n generators 


G,= (= | (10.239) 
=0 


i 0a; A 
of infinitesimal transformations. Notice that 
[a] = G[-a] (10.240) 


applies to any infinitesimal generator. 
More generally, if x is an m-dimensional coordinate vector and a is an n-dimensional 
group vector whose identity element is a = 0, the Taylor expansion 


Tlalx = ¿lx a] => (T[-alx), = £;lx; -a] ~ x; - sy a, (52) (10.241) 
izl iJa= 
can be used to write 
f|Ti-a]x] = flx] - »> “a (72) (10.242) 
j=l i=l a= 
or 
Tlalflx] ~ í = o aa] fix (10.243) 
= 
where the infinitesimal generators 
de le | a (10.244) 


are linear differential operators. Therefore, the operator 


-i) 4G, 


i 


T [a] = Exp[—ia- G] = Exp (10.245) 


performs finite transformations from f[x] to 7f[x] = F[T77'x]. The inclusion of the factor 
of i in the definition of infinitesimal generators ensures that those operators are hermitian 
when the transformation is unitary. Thus, 


T" [a] = T'a] = 7[-a] = Explia- G'] = Explia - G] = Ge G, (10.246) 


for arbitrary real a;. Although many mathematicians do not include this phase, applica- 
tions Of Lie groups to quantum mechanics benefit from this convention because physical 
observables are represented by hermitian operators. We will adopt this convention here, 


412 10 Group Theory 


but the reader must be aware that it is not universal. Finally, notice that the generators 
satisfy a product rule 


G,flxlglx] = glx1G,flx] + fx1G, fx] (10.247) 


where the differential operator G, acts on everything to its right within a term. Care must 
be taken in evaluation of 7 because the generators do not necessarily commute with each 
other. 

Next consider an arbitrary Lie group T with parameters (a,) and infinitesimal genera- 
tors {G,} and assume that a general element can be expressed in a form 


Y a,G; 


1 


T [a] = Exp[—ia - G] = Exp (10.248) 


that is sometimes called a Lie series in analogy with a Taylor series. Closure of the group 
imposes important conditions upon the generators. Suppose that 


T, = Exp| -ie,G,] ~ 1 - ie,G, - 18G? (10.249) 
. . 242 
T, = Exp| -ig,G,] = 1-is;¡G,- 38565 (10.250) 


represent two distinct infinitesimal transformations with inverses 


T;' = Explie,G,] ~ 1 + ie,G, - 4e7G? (10.251) 
T7' = Explie,G,] ~ 1+i8,G, - 656% (10.252) 
such that 


ae A 


272 202 
TTP TT; ~ (: +i(8,G, +8,G,) - 8,2,G/G, - 38G; - a) 


; 242 242 
x (: — i(e,G, + £,G,) - €,8,G,G, - 387G; — 190) 
. 242 ME 
=1- i(e,G; + £/G,) — £;£;G;G; — 36; G= 356; +1 (e,G, + e¡G,) 
22, 22 272 212 
+ EG; + EG +€8/(G,G, + GG) - €,£,G,G, - 387G; - 367G; 
= 1 - 8,8; (6/G, - G,G,) 
(10.253) 


to second order in e; and e,. If this product is to represent an element of T, the commutator 
[G,, G j] = G;G zG j G, must be a linear combination of infinitesimal generators of the form 


: k 
[6,6] =i> 4,6, (10.254) 
k 
where the structure constants E ¡ are real numbers which are fundamental properties of 
the generators that are independent of the choice of representation and are invariant with 
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respect to similarity transformations. The structure constants must be antisymmetric in 
their lower indices, such that 


c= ch, (10.255) 


because the commutator is antisymmetric. Although it should be obvious that the structure 
constants are determined by the generators, Sophus Lie proved the more powerful con- 
verse that groups of this kind are uniquely determined by specification of their structure 
constants. 

Another important relationship among the structure constants uses the double commu- 
tator 


|e. [G,, | = 16,3 Y ¿6 => Gere, (10.256) 
í l,m 
and the Jacobi identity 
|e. [G,, a] + |e, [Ge ail + |o [G,, G | =0 (10.257) 


to find 


O AO Ce. y e a= (10.258) 


l,m l,m l,m 


or 
2,21 agate. +a da, = 0 (10.259) 


Thus, assuming that the G,, are linearly independent, the structure constants must satisfy 
a Jacobi identity of the form 


S (hacks + eich + cea) = 0 (10.260) 
1 


Lie proved that antisymmetry of lower indices and compliance with the Jacobi identity 
are necessary and sufficient conditions for generation of a continuous group from local 
structure constants. 


10.4.4 Example: Linear coordinate transformations in one dimension 


To illustrate these relations, we return to the two-parameter group of linear coordinate 
transformations 


T[x]x = ay +(1+a,)x (10.261) 


such that 


ð 
Tlalflx] = f |T lal] = f [āo + (1 +a,)x] = fix l-a - ax +0" (10.262) 
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where 
A, ae (10.263) 
0 1+ aj 0 
AE Ree (10.264) 
1 1+ aj 1 


for infinitesimal transformations. Thus, we identify the generators as linear differential 
operators 


.0 
Gy = -i= (10.265) 
G, = w (10.266) 
i is Ox š 


The commutator is evaluated using 


ð ( ðf ð \ ðf Of of Pf of 
(GoG, - G,Gp) fix] = - a (x h(x =) eo ae og ae (10.267) 
to obtain 
[Go Gi] = -iGo = 1 =-1, 1 =0 (10.268) 


The structure constants are indeed real numbers and are antisymmetric wrt lower indices. 
The only set of indices for which the Jacobi identity does not vanish immediately is 


1 044 0, 0) 00 o 0 _ 
5 (corre + Cicho + chochi) = 00.10% + koco =0 (10.269) 
7 


such that the identity is indeed satisfied. Therefore, the general transformation operator for 
functions takes the form 


T [a] = Exp - (a, + a,x) =| (10.270) 


10.4.5 Example: SO(2) 


A finite rotation within a plane is performed by the transformation matrix 


_ (Cos[6] —Sin[6] 
rie = (Si pe (10.271) 


and one can (and should!) verify that R[6] satisfies all of the requirements of a group and 
that the group multiplication law 


R[0,] R[6,] =R[O, + 0,] (10.272) 
is abelian. The inverse matrix 
R™'[6] = R[-6] = R"[6] (10.273) 


is the same as the transpose (indicated by superscript T), hence, rotation matrices are 
orthogonal. Also note that Det[R] = 1. Thus, rotations in the plane are isomorphic with 
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the group of all two-dimensional special orthogonal matrices, designated SO(2); the term 
“special” refers to the requirement that the determinant be +1, which excludes reflections. 
An infinitesimal rotation is described by 


l -eé 
e 1 


Rle] = ) =1- ine (10.274) 
where | is the unit matrix, here two-dimensional, and where we identify the generator for 


coordinate transformations as o, where 


0 -i 
On = E o) (10.275) 
happens to be one of the Pauli matrices. A finite rotation is then given by 
; oo., 19 Y 
R[9] = lim R A = lim (1 - o) = Exp[-i605| (10.276) 
noo n n>00 n 


Perhaps it is worthwhile to demonstrate that the original matrix is recovered from the 
power series. Expanding 


fa.) SLI AA S 
Exp| -i90,] = 2 — s LO IN ye A (10.277) 
and using 
a=1=0"=l of =a; (10.278) 
we obtain 
Exp|-i00,] = Cos[9]1 — i Sin[9]o, = R[9] (10.279) 


as claimed. This result is a generalization of the Euler identity to two dimensions. 

Next consider the transformation operator for functions of these coordinates. Let x = 
{xi x,) represent the original coordinates, x’ = R"[elx = {x1, x2} represent coordinates 
after a passive infinitesimal rotation of the axes, and expand 


of 


F of 
Ox, 


, , Of 2 of 
fix] ~ fix] + (x, -n)a 5 (hm) =101+e [ze 3) (10.280) 
to first order. The final term in parentheses is the infinitesimal generator for transformations 
of functions from the old coordinates to the new coordinates. Therefore, the operator that 
transforms from f[x] to Fx] = f[R!x] is 


ð 0 > 
L= -i (« — -%3,.) => f[x] = Exp[-10L1f[x] (10.281) 
OX Ox, 
Notice that the operator L is antisymmetric, like the matrix R, and that when the derivatives 


are interpreted as momentum operators, L is the operator for orbital angular momentum 
perpendicular to this plane. 
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10.4.6 Example: SU(2) 


The states of a spin-3 particle are described by two-component spinors of the form 


_(% 10.282 
I) o (10.282) 


where the amplitudes y, are expressed relative to a particular orientation of the coordinate 
system. Rotations of the coordinate axes mix the components of the spinors, preserving its 
norm, and are described by unitary 2 x 2 matrices, such that 


An Wi) (Uy, Uy 
|v’) uw = (+) iC ale (10.283) 


where UU* = 1. The set of all unitary 2 x 2 matrices with unit determinant forms a group 
that is designated SU(2), which stands for special unitary group in two dimensions. Any 


2 x 2 matrix contains four complex numbers or, equivalently, eight real parameters. The 
requirement of unitarity imposes four constraints and the restriction upon the determinant 
imposes another constraint, leaving three independent real parameters. Thus, an infinitesi- 
mal transformation U can be parametrized in the form 


U= Exp|-ion g (10.284) 


where ñ is an arbitrary unit vector in three dimensions and S is a set of three appropriately- 
chosen 2 x 2 matrices that represent the infinitesimal generators of this group. Unitarity 
requires S to be hermitian and the condition Det[U] = 1 requires Tr[S] = 0. Therefore, we 
express the generators in the form S = G/2 where the Pauli matrices 


0 1 0 -i 1 0 
a=(; o) =(; a): n=l A (10.285) 


constitute a complete set of linearly independent, traceless, hermitian, 2 x 2 matrices and 
where the factor of 5 ensures that the eigenfunctions 


1 1/1 0 1/0 
00, 00 nao 


have eigenvalues +) for S}. Therefore, if we rotate the coordinate axis through angle 
0 about axis fi, the components of a spin-3 wave function transform among themselves 
according to the unitary matrix 


U [| = Exp|-id- èj 2| (10.287) 


where 6 = Gñ encodes the three parameters of SU(2). 

Direct calculation will show that the Pauli matrices satisfy commutation relations of 
the form 

[0; a] = 282, 40% = [So S| = i£,45; (10.288) 


where, by convention, one sums over a repeated index; hence, the right-hand side is 
summed over k. Also, recall that the Levi-Civita symbol Ex jk takes the values +1 for a 
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symmetric permutation of the indices, —1 for an antisymmetric permutation, or 0 when- 
ever two indices are repeated. Thus, the structure constants are antisymmetric and one can 
show that they satisfy the Jacobi identity, as required for a Lie group. Finally, notice that 
the commutation relations for SU(2) can be expressed in the form 


SxS = iS e xò = 20 (10.289) 


where the cross-product does not vanish because the components of @ do not commute 
with each other. One must be careful when generalizing results from vector algebra to 
noncommuting operators or matrices. 

The group SU(2) also admits matrix representations of higher dimension. Suppose 
that we transform coordinates according to x’ = Rx where the matrices R,[6,] describe 
independent rotations about each of three axes. The corresponding infinitesimal generators 
S; for transformations of a vector are then obtained by differentiating R, as follows. 


1 0 0 a 0 0 0 
R¡[0,] =|0  Cosló,] — Sin[6, =s =p!) =-ij0 0 1 
0 -Sin|6,] Cos|o, 1 /6,=0 0 -1 0 
(10.290) 
Cos[0,] 0 -—Sin[o,| 0 0 -1 
R, [0] = ji 0 = 5,=-i( S) =-i/0 0 0 
Sin|9,] 0  Cos{6,| 2 /0,=0 10 0 
(10.291) 
Cos|6,| Sin[@,] 0 jk 0 10 
R, [03] =|-Sin[6¿] Cos[@,] 0 = s,=-i( FF =-i|-1 0 0 
0 0 05 Jo,=0 0 0 0 
(10.292) 


These generators also satisfy the commutation relations of SU(2). More generally, the three 
operators S that describe the intrinsic spin are traceless hermitian matrices of dimension 
2s + 1 that satisfy the SU(2) commutation relations 


[S, S;| = 18,8, (10.293) 
such that 
U p = Exp] -ið l g (10.294) 


generates rotation of the state vectors for particles with spin s. 


10.4.7 Example: SO(3) 


Coordinate rotations that leave the Euclidean norm x1 + 12 + x3 invariant are described 
by orthogonal matrices R[@] where 6 stands for the three parameters needed to specify the 
rotation. Proper rotations that exclude reflections also require Det[R] = +1. Thus, rotations 
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are isomorphic with the group of special orthogonal matrices in three dimensions, namely 
SO(3). Let 


(10.295) 


x, = & [x5 6] = R; | | my 


represent the coordinate transformation. Infinitesimal coordinate rotations are described 
by 


Ex E+OxE = Ex +E, 10%, (10.296) 


where ĝ = [0,, 0,, 03} is an infinitesimal vector and where the summation convention is 
employed. The infinitesimal generators are then 


„ðE; ð oe 
= "00; Ox, = ees Laer D =XXp (10.297) 
where p = ~iV is the momentum operator. These generators can be expressed in either 
cartesian or polar coordinates as 
ô 10) 0 ô 
L, = -ily— -z—] = -i| -Si == tl0l — 10.2 
by > 2) if Sin[¢] 90 Cos[@] Cot[6] 35) (10.298) 
fo ð i ð 4 ð 
L, = —i (= - x£) =-i (cosa — Sin[¢] Cold) (10.299) 
ô 0 ô 
L, =-i = =-i 10.300 
e ile, Ya.) i ( ) 


and are easily seen to satisfy commutation relations 
[Lo L,] = i£; ¿¿L, (10.301) 


that are the same as those of SU(2). Nevertheless, these groups are not identical and we will 
later exploit the homomorphism between SU(2) and SO(3) to analyze their representations 
in considerable detail. For now, suffice it to recognize that the operator that transforms 
functions whose coordinates are rotated through a finite angle is given by 


R lèl = Exp|-i È] (10.302) 


where @ = {6,., 6, 6.) specifies the axis of rotation and the angle and where L are the 


differential operators defined above. 


10.4.8 Total angular momentum 


We can now combine spin and orbital angular momentum by considering the transforma- 
tion properties of a wave function of the form 


y, [x 
¥ [x] = AS (10.303) 
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that consists of 2s + 1 functions of the three-dimensional vector x. Rotation of the coordi- 
nate system will both change the coordinates in each function and mix the components of 
the state vector. Thus, we write 


W [è] = Exp|-ióh - 5] [R [-0811] (10.304) 


where $ mixes the components and R acts upon the coordinates. Expanding the infinitesi- 
mal transformations to first order 


Y [è] ~ (1- ¡ón -S) ¥[1 - oa xa] = (1- ion- S)(¥ [3] - 0 -+x We [x)) (10.305) 


and dropping higher-order terms, we can identify the infinitesimal generators as 


J=L+S (10.306) 
whereby 
OW [x hee Pe a ajra 
‘| Linde [+] = Y [5] = Exp| -i0-3] Y [] (10.307) 


provides the wave function in the rotated coordinate system. 

The spin of an elementary particle is an intrinsic property that should be independent 
of its position or momentum. Therefore, the S operators should be independent of x or 
p and, hence, commute with L; S acts upon a nonclassical internal space represented by 
the 2s + 1 components of the wave function while L acts upon geometrical variables with 
classical interpretations. Thus, the total angular momentum J = L +5 contains orbital and 
spin contributions that satisfy the commutation relations 


[Lo L;] 65,5, (10.308) 
[So S;] = is, 54S (10.309) 
[J J] = is; (10.310) 
[L, 5;] =0 (10.311) 


such that the transformation operators 
R p = Exp|-i 2| = Exp|-i È] Exp|-i6 g (10.312) 


contain two factors that operate on different aspects of the wave function. If the hamilton- 
ian is rotationally invariant, such that [H, J] = 0, we can construct energy eigenfunctions 
that are also eigenfunctions of J? and J,. Often [H, S?] = 0 also indicates that S? is con- 
served, but L?, L,, and $} usually are not when s > 0. 


10.4.9 Transformation of Operators 


Suppose that U is a unitary operator that performs coordinate transformations upon func- 
tions and that A is an arbitrary operator on the same Hilbert space. Operators in the new 
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and old coordinate systems are related by the familiar similarity transformation 
y” = Us = H' = UHU! (10.313) 


If we now represent U = Exp[—iaS] in terms of a hermitian infinitesimal generator S and 
a real parameter a, this similarity transformation can be evaluated as a power series in œ 
with the aid of the Baker—Haussdorff identity 
(ia)? ia) 
2! 3! 
where the number of times S appears in the leading position of each multiple commutator 
is given by the associated power of —ia. Verification of this identity is left as an exercise 
for the reader. For infinitesimal transformations, we find 
( OH’ 
Oa 
An important consequence is that H is invariant with respect to transformations generated 
by operators with which it commutes, such that 


[S,H]=0= H’ =H (10.316) 


eS He = H+(-ia)[S, H]+ 


[S, [S, H]]+ 


[s [S, LS, Hy] + (10.314) 


= -1[S, H] (10.315) 
a=0 


10.4.10 Invariant Functions 


A function y that is unchanged when its coordinates are transformed by any group element 
is described as an invariant of the group and satisfies 


y [Rx] =y[x] = Ry = y = Expl-ia - Gly = y (10.317) 


where a is the parameter vector and G is the corresponding vector of generators. For small 
a; we can use the first-order expansion 


(1 — ia- Gyi] = Vix] => Gy =0 (10.318) 


to conclude that invariants are annihilated by any of the group generators, labeled by 
i = 1,n. For example, consider a central function of the form w[x, y] = wir] where 


r= ¥. x + y? and the infinitesimal generator 
ô ð 
) = -i (10.319) 
Op 
for rotations in the plane. Obviously, w is invariant wrt L, because L_w[r] = 0 for any {x, y). 
If Gy = 0 for some generators but not others, we can describe y as invariant with 
respect to the subgroups generated by each G, that annihilates y. Suppose that y is an 
invariant wrt G, and ¢ is an arbitrary function. Then 


Giph = PG Y + WGib = Y GP (10.320) 
permits the invariant to be factored out as if it were a constant. Similarly, the transformation 
Rw = Exp| —ia,G,] Yo = y Exp|-ia,G,| 6 (10.321) 


only affects ¢, the noninvariant (i.e., variable) factor. 
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A function y[x] that satisfies 
Gy[x] = Alxlw[x] (10.322) 


where A[x] is an invariant of the infinitesimal generator G is described as an eigenfunction 
of G with eigenvalue A[x]. Finite transformations based upon G then take the form 


Exp[-iaG] [x] = e yx] (10.323) 


Suppose that y, and y, are two different eigenfunctions of G with eigenvalues A, and A,, 
respectively. Then 


Gy xl, [x] = 4,Gy, +y Gn = (Ab) + Ai) Yeo Lx] (10.324) 


shows that Y, y, is also an eigenfunction of G whose eigenvalue is A, + A,. 

Let T[a] represent an element of a symmetry group with parameter a that acts on 
coordinate x according to x’ = T[a]x and let 7 [a] represent the corresponding operator 
upon functions, such that 


T lalwix] = 4 [TIal] (10.325) 


Suppose that the Hamiltonian operator H commutes with 7, such that [H, 7] = 0 and that 
(4, i = 1, n} are a linearly independent set of eigenfunctions for H that share a common 
eigenvalue E, such that 


Hy, = Ey; => HT, = ETY, (10.326) 


Thus, 74, is also an eigenfunction of H with the same eigenvalue and can be expanded as 
the linear superposition 


Taly; = y; [TIal] = > y Led lal (10.327) 
j=1 


with coefficients that depend upon the group parameter. The ordering of the indices may 
appear unusual but is needed to ensure that the matrices D[a] constitute a representation 
of the symmetry group 7. Suppose that we combine two symmetry operations 


TID]T lal, = a ;Djilal = > Y,D, [DID ila] (10.328) 
j=l jk=1 
such that 
T[b]T [a] = T[c] = DIb]D[a] = Dic] (10.329) 


Therefore, D[a] is an n-dimensional matrix representation of group element T [a]. Notice 
that this ordering is consistent with the inverse relationship between 7 and T, whereby 


Tlelwlx] = y [T ie] = y [THT aly x] = y [Tal Lola] (10.330) 
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10.5 Lie Algebra 
10.5.1 Definitions 


A Lie algebra is a linear vector space with a law of composition (or multiplication) denoted 
by [A;, A;] that satisfies the following properties: 


e closure under multiplication: if A and B are members of the algebra, then the product 
[A, B] is also a member 


e antisymmetry under multiplication: [B, A] = —[A, B] 
e distributive law: [A + B, C] = [A, C] + [B, C] 
e Jacobi identity: |A, [B, C]] + [B, [C, Al] + [C, IA, B1] = 0. 


This abstract notion includes both Lie groups expressed in terms of differential operators 
that generate infinitesimal coordinate transformations and their representations in terms of 
matrices. The law of composition could be the commutator 


[A, B] = AB — BA (10.331) 


for quantum mechanics or it could be the Poisson bracket 


ðA OB ðB ðA 
A, B] = pa E 10.332 
aa 2 & Op; ðq; 2) i 


H 
for classical mechanics, where q, p; are pairs of canonically conjugate generalized coordi- 
nates and momenta. The fact that general theorems depend upon abstract postulates instead 
of specific realizations gives this branch of mathematics considerable power and versatil- 
ity. 

The dimension of the algebra is the dimension of its underlying vector space or, equiv- 
alently, the number of independent generators. An algebra A is abelian if [A;, A;] = 0 for 
all {A;, Aj) € A. A subalgebra is a subset of the vector space that satisfies all conditions 
for a Lie algebra. An algebra is described as simple if it does not contain an invariant 
subalgebra or semisimple if it does not contain an abelian subalgebra. An algebra is semi- 
simple if and only if it is a direct sum of simple algebras. A compact algebra is necessarily 
semisimple. It is useful to define a metric tensor, also known as a Killing form, as 


bye ee (10.333) 
k,l 


where the proportionality constant can be chosen for convenience. Cartan’s theorem states 
that a Lie algebra is semisimple if and only if Det[g] + 0. Under those conditions we can 
also define an inverse g’/ that satisfies 


Y 218) = 5, (10.334) 
k 


The rank of an algebra is the maximum number of linearly independent mutually com- 
muting operators that it supports. Let {P,,i = 1, r} represent a set of mutually commuting 
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operators, such that [P,, P;] = 0 for all i, j < r and let {Q i j = r + l, d} represent the 
remaining generators. Racah proved that r independent Casimir operators {C,,i = 1, r} 
that commute with all A; € A, such that 


le: P| = [C, 0,] =0 (10.335) 


Y=] 


can be constructed for any semisimple Lie algebra of rank r from linear combinations of 
products of the generators. For example, 


C, « y FAJA, (10.336) 
ij 

commutes with all A; € A; the proportionality constant can be chosen for convenience. 
This lowest-order Casimir operator is usually very useful but higher-order Casimir opera- 
tors are often very complicated. Schur’s lemma tells us that any matrix which commutes 
with all the matrices of an irreducible representation must be a multiple of the unit matrix. 
Therefore, the irreducible representations of A can be labeled with the eigenvalues {c;} 
for the Casimir operators while their members are distinguished by the eigenvalues {p,} 


for the selected (P,). The basis states for such representations can then be expressed in the 
form 


Cyley---¢,3 prep) = Cj [C173 Pr Py) (10.337) 
P; Jeee, Prep = Pi jeee, Prop (10.338) 


Furthermore, raising and lowering operators (ladder operators) for the p; can then be 
constructed from linear combinations of the Q,. 


10.5.2 Example: SU(2) 
For example, SU(2) is defined by the commutation relations 
[So S;] = 18, 545 (10.339) 


and has dimension 3 and rank 1. Its metric tensor 


y is 3 eae (10.340) 
kl kl 


shows that it is semisimple. Because the normalization of g; ; is irrelevant, one normally 
redefines the metric tensor for this group to be g, ; = giz 6; ; There is only one Casimir 
operator 


C= St + 854+ S83 (10.341) 


and it is proportional to the operator for total spin, S*. Therefore, if [H, S] = 0, one may 
classify eigenstates |s, m) by their eigenvalues for 5? and one of the S,, arbitrarily chosen 
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as $}, such that 


S3ls, m) = mls, m) (10.342) 
Furthermore, 
(s, m|S?| s,m) = m? (10.343) 


demonstrates that there must be a maximum value of m for any finite s; equality applies 
only when s = 0. 

To determine the eigenvalues of S? and the allowed values of m for each irrep s, it is 
useful to define the ladder operators 


S, =8,+iS, (10.344) 


whose commutation relations 


[S>, S,] = +S, (10.345) 
[S$,S,]=0 (10.346) 
[S,, S_] = 253 (10.347) 


can (and should!) be verified easily. Applying the first set of commutation relations 
(SS, — S,S3)ls, m) = (S3 — m)S_|s,m) = +8, 15, m) => 538,15, m) = (m + 1)S,,|s, m) 
(10.348) 


demonstrates that SIs, m) is an eigenfunction of 5, with eigenvalue m + 1. Thus, the effect 
of S, is to raise m by one unit while S_ lowers m by one unit. We can express $° in two 
alternative forms 


S =S_S, + S3(S3 + 1) = S,S_ +$3(S; — 1) (10.349) 
and choose to label irrep s using the maximum possible value of m, such that 
Sls, s) = 0 = ls, s) = s(s + Dls, 5) (10.350) 


Closure of irrep s with respect to the ladder operators, S,, requires the allowed values of 
m to form a sequence s, s — 1,..., s — where n is a finite nonnegative integer, such that 


S_|s, s—n) = 0 = Sls, s—n) = (s—n)(s—n-—1)\s, s—n) = (s—n)(s—n-1) = s(s+1) 
(10.351) 


requires n = 2s. Therefore, s is either integral or half-integral such that the dimension of 
irrep s is 2s + | and its eigenvalues are given by 


Ss, m) = s(s + Dis, m), s=0LLo Zi. (10.352) 
Sls, m} = mls, m), m=s,s-—l1,..., -S (10.353) 


It is important to recognize that this analysis was based entirely upon the commutation 
relations of SU(2) and did not employ any specific representation of the associated Lie 
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algebra. Therefore, it applies equally well to representations of any dimension, not just 
the two-dimensional representation that originally motivated the correspondence between 
the generators of SU(2) and the operators for spin. One might expect the same analysis 
also to apply to the representations of SO(3) that describe orbital angular momentum if 
one simply replaces S > L and s > / everywhere. However, that correspondence does not 
exclude half-integral orbital angular momentum, which is actually a tricky issue. For clas- 
sical physics the requirement of single-valuedness excludes half-integral values because 
e*"9 would change sign under @ > ¢ + 27 if m were half-integral, but quantum mechan- 
ics permits this sign change because observables depend upon the absolute magnitude of 
the wave function. On the other hand, because orbital angular momentum is a classical 
concept, we might appeal to the correspondence principle to require single-valuedness 
anyway; no such appeal is possible for intrinsic spin because it is inherently nonclassical. 
For a more rigorous argument that limits / to integer values, we must return to the differ- 
ential operators and their eigenfunctions; that argument is left to the exercises. Perhaps 
we should have anticipated that the algebraic argument based upon commutation relations 
alone would be inconclusive because the generators of SU(2) obey the same commutation 
relations as those of SO(3), yet SU(2) was designed to describe spin-5. Therefore, such an 
analysis does not distinguish between spin and orbital angular momentum. This is actually 
a strength of the method! If we define angular momentum by the requirement that eigen- 
states of a rotationally invariant Hamiltonian transform according to 


R= Exp] -i ; J| (10.354) 


where the infinitesimal generators J=L+S obey the commutation relations 
[Jp 4,] = is, pele (10.355) 


then both spin and orbital contributions are included. 


10.6 Orthogonality Relations for Lie Groups 


Orthogonality relations were crucial to the analysis of finite groups and we expect anal- 
ogous relations to be central to applications using Lie groups, but we must generalize 
from summation to integration using weight functions that preserve group properties. Let 
a = {a,i = 1, m} represent a point in the m-dimensional parameter space of a Lie group 
and let da represent an infinitesimal volume surrounding that point. If b is another point in 
the same parameter space, the group multiplication law c = pla, b] assigns point c to the 
composition of elements a and b. If a is a unique point and the number of elements in the 
volume at bis p[b] db where p[b] is a density function, then the number of points 


plc] de = p[b] db (10.356) 


near c must be the same because group multiplication by a provides a one-to-one corre- 
spondence between elements near c with those near b. Suppose that we choose a to be the 
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identity element, such that 


= $,{0, 6) = db, = Y (242) da, (10.357) 
a=0 


aA da; 
The volume at b is then 
db = J[b] da (10.358) 


where 


J[b] = El dla, 2) | 
da; a=0 


is the Jacobian determinant. Let p[0] denote the density near the identity element, such 
that 


(10.359) 


plo] 


plb] db = pl0]da = plb] = Sig) 


(10.360) 


defines a suitable density function. The density at the origin is usually assigned p[0] = 1, 
somewhat arbitrarily. This version is described as a right density because it is based upon 
the right argument of c = pla, b]. 

Suppose that y[a] is a function of the group parameters. Recall that the rearrangement 
theorem for finite groups states 


Y Mal = vish] =>) Ying] (10.361) 
g g g 


where h belongs to the group and the summations are taken over all group elements g. 
Generalizations to continuous groups then take the form 


f Wlalpla] da = f Y Ola, b]|pglb] db = f dbp, [bW| db, al] (10.362) 


where one must recognize that left and right densities need not be equal if one or more 
parameters has an infinite range. Thus, depending upon application, we may require both 


ono fo) | 
j a=0 

ono) 
da; a=0 


where p[0] may be chosen for convenience. Fortunately, one can show that these densities 


(10.363) 


(10.364) 


are equal for compact groups and we usually will not have to confront this issue. 
Perhaps the most important application of group integration is to the orthogonality 
properties of irreducible representations whose matrix elements are analytic functions of 
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the continuous group parameters. Recall that the orthogonality theorem for two irreps of a 
finite group with Ng elements takes the form 


aran N 
X DO. [gl Daslgl = 8 045809) (10.365) 
geG t 


where n, is the dimensionality of irrep i. This result can now be translated to the continuum 
according to 


i j O. 98 y%.j 
f D? [al Daplalola] da = <42 i pla] da (10.366) 
i n; 
where the integral over density is the analog of Ng. Naturally we assume that the integrals 


exist but will not investigate the necessary conditions; suffice it to say that problems are 
rarely encountered here. Similarly, orthogonality of characters 


Se el xa le] = Netx, (10.367) 
k 
takes the continuum form 


fe [a] y [a] da = ds fetal da (10.368) 


Example 


For example, consider 


x’ = ay +(1+4,)x, a +-1 (10.369) 
such that 
Op Op 
pola, b] = ay + by (1+a,) = Bis =1, aa, = by (10.370) 
Op Op 
¢,[a, b] =a, +b, (1 +a) => an = 0, Ja, =1+b, (10.371) 
results in 
Jb]=1+b, (10.372) 
such that 
b] = | 10.37 
plb] E (10.373) 
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10.7 Quantum Mechanical Representations of the Rotation Group 
10.7.1 Generators and Commutation Relations 


We have seen in previous sections that the wave functions for spin-3 transform among 
themselves according to SU(2) while those for orbital angular momentum are governed by 
SO(3). Both of those groups have dimension 3, rank 1, and satisfy the same commutation 
relations. Therefore, we generalize the definition of angular momentum to include both 
spin and orbital angular momentum by postulating commutation relations of the form 


[Je Fj] = ie; jade (10.374) 


for the generators of rotations of the wave function and define the square of the total 
angular momentum as the Casimir operator 


Pia + = |P,J,]=0 (10.375) 


that commutes with all of the generators. Therefore, we can define basis vectors y ;,, that 
are simultaneous eigenfunctions of J? and one of its components, arbitrarily chosen as J, 
such that 


JY im = MY jm (10.376) 


It is also useful to define raising and lowering operators, also known as ladder operators , 
according to 


ISIE (10.377) 
such that 

DEEA (10.378) 

[7,4] =0 (10.379) 

EA = 2, (10.380) 
Thus, 

(FI, =I I) Vm = FIV > ITV = NDIY (10.381) 

(FI. II) Wim = Ig = TIA im = M- WIV im (10.382) 


demonstrates that J, ;,,, is an eigenfunction of J, with eigenvalue m + 1 — the effect of J, 
is to raise m by one unit whereas J_ lowers m by the same amount. Suppose that u is the 
maximum possible m in the irreducible representation labeled j. Expressing J? in the form 


P=JJ,+I(1+1) (10.383) 
we obtain 


y = Maxim] = JW), =0 = PY), = wet DY, (10.384) 
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and can label the representation by its maximum m such that 
Pb im = JG + Wim (10.385) 


for any allowed m. Application of (Ca to y; yields an eigenfunction Y; ;_,, and must 


terminate at a finite value of n because the representation j is finite. Expressing J? in the 


form 
JP =J,J_+J,(J,-1) (10.386) 
and using J_4; j-n = 0, we obtain 
2 _¢: aps 
PU jon = MG DY; = IG + DY in (10.387) 


where the final step uses the fact that the eigenvalue of J? is independent of m. Thus, we 
find j = n/2 where n is an integer, and conclude that j is either integral or half-integral 
according to whether n is even or odd. The operators J,, J, transform the eigenfunctions of 
J* with common j among themselves and must span the vector space because the represen- 
tation we seek is irreducible. Therefore, the irreducible representation labeled j contains 
2j + 1 basis vectors that satisfy 


PW im = ¡GH Dim (10.388) 
JY jm = UG jm (10.389) 


form = -j,-j+ 1,..., j- 1, j. 

All rotations through the same angle, regardless of axis, are similar and belong to 
the same class. Therefore, there are an infinite number of classes and, correspondingly, an 
infinite number of irreducible representations. To demonstrate this class structure, consider 
a rotation S = R,[6]R,[¢] that creates a new coordinate system with its polar axis in the 
direction {0, 6} wrt to the original coordinate system. If we follow a rotation about this 
new axis by S”! to restore the original coordinate system, the net result of SR, [a]S~! is to 
perform a rotation through angle a about an axis specified by (6, ¢}. This general rotation 
is similar to R, [æ] by construction. Thus, all rotations through the same angle around any 
axis are similar to each other and form a class. The irreducible representations for rotations 
about the z-axis take the form 


DW [R.[0]] = Exp[-im0]1 (10.390) 
with character 


Sin[(27 + 1)0/2] 


Sin[972] (10.391) 


J 
xP 10] = TrExp[-im0]l = > Exp[im0] = 


m=-j 
where 0 labels the class. More generally, an arbitrary rotation is represented by 
DY [w] = Exp[-iw - J] (10.392) 


where w = {6,, 0, 6,}. The D' matrices are unitary, which requires the J; operators to be 
hermitian. 
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Finally, we evaluate the matrix elements of J,. Let 


JV im = Bm jm (10.393) 
JW im Pin jm- (10.394) 


and observe that these operators are hermitian conjugates of each other 
Lal. LA (10.395) 
Furthermore, the cross-products take the form 


JJ_ =P -I (1-1) (10.396) 
JJ =P -J(1,+1) (10.397) 


Thus, the norms 


2 
janl = En Wim 


inl = (byn 


Im) = (Wim [Ite | Vim) = IG + D= mtn + 1) (10.398) 


Jw = (Wim I| Wim) = IG + 1) - mam- 1) (10.399) 


determine the matrix elements up to phase. Choosing the so-called Condon-Shortley con- 
ventions 


FV im = y JG+D=mm+ DY + (10.400) 


IV im = Vidi +1)- mm- DY jm (10.401) 


then determines the relative phases among the y 
J_W;_; = 0, as required by the preceding analysis. 


jm basis vectors. Notice that J; ; = 


10.7.2 Euler Parametrization 


The orientation of a three-dimensional object can be specified by three Euler angles 
describing the following sequence of operations: 


S/S SF OF 


e rotation about the 2-axis through angle «œ, producing new axes X’, Y, 2’ = Z 


SAK S 


e rotation about the }’-axis through angle $, producing new axes X”, 9” = Y, 2 


e rotation about the 2”-axis through angle y. 


Alternatively, the same orientation can be achieved using the following sequence opera- 
tions with respect to space-fixed axes: 


e rotation about the 2-axis through angle y 
e rotation about the }-axis through angle 6 


e rotation about the 2-axis through angle a. 
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Thus, a general rotation matrix takes the form 
Rla, p, y] = R¿la]R, [61R,[y] (10.402) 


with inverses 
(Ria, B, y)? = R,¿[-y1R,[-B1R,[-a] = RI-y, -B, —a] (10.403) 


The ranges for these angles are 0 < œ <271,0<f<r, and O < y < 27. However, a spec- 
ified rotation does not uniquely determine the Euler angles — for example, the parameters 
fa, 0, y) and [a + K, 0, y — x} produce the same rotation for any x. 

Suppose that Y, „ is an eigenfunction with total angular momentum j and magnetic 
quantum number m. The effect of a rotation is then 


j 
Q) 
Ria, B, Yim = Y, Y jmDremlos B, Y] (10.404) 


m =-j 


where DO is the rotation matrix for irreducible representation j. Please note once again 
the ordering of the indices. Having chosen basis functions to be eigenfunctions of J,, we 
can write 


Dla, B, y] = Exp|-iaJ,| 418] Exp| -iyJ,] (10.405) 


where the reduced rotation matrix d' depends only upon £ while the rotations about the 
z-axis are diagonal. Thus, 


j , 
RIO, BW im = >) Vim’ wv'mlB] Exp|-i (my + m’a)| (10.406) 
m=-j 
10.7.3 Homomorphism Between SU(2) and SO(3) 


The groups SO(3) and SU(2) both require three real parameters and their generators satisfy 
the same commutation relations, suggesting that these groups are homomorphic. There are 
two common parametrizations of SU(2). First, consider the Pauli representation 


Po tip, Prt a PES 
U= à . = P 0, + ip: o (10.407) 
pa +ip, Py—ip,) Poor? 


where the parameters p, are real, 


1 0 01 0 -i 1 0 
“= (4 i a =(; J 0 =( a «=(4 o ene 


are the basis matrices, and where the constraint 


p=1-p-p-p (10.409) 
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ensures that Det[U] = +1. Note that there are only three independent parameters and that 
we require |p| < 1. Alternatively, the Cayley—Klein parametrization 
e“ Cosly]  e*Sin[y] 
U[É, y, ] = ies a 10.410 
[6,4 g] le i€ Sin[y] e“ Cos[w] ( ) 
has the advantage that the parameters are unconstrained and U is manifestly unitary. 
Consider the matrix 


ds Z x—ly 
p=T-0 , a ex (10.411) 
from which one can read the components of 7 in a particular coordinate system using 
x= (Pa +p12)/2 (10.412) 
y = (01 —Pja)/ 2 (10.413) 
Z2=P1 (10.414) 


Next, suppose that we evaluate 

p =Upur” (10.415) 
with a similarity transform and evaluate the new coordinates 

7 = Ar (10.416) 


by interpreting the elements of p’ in the same manner. The resulting linear transformation 
is represented by the 3 x 3 matrix A. If A is an orthogonal matrix with determinant +1, it 
can be interpreted as a coordinate rotation that establishes a homomorphism between the 
SU(2) matrix U and the SO(3) matrix A. After some straightforward but tedious algebra 
(see exercises), one finds 


1-2(p3 + p3) 2(pops+PiP2) 2(P1P3- Pop») 
A=|2(p,p)- Pops) 1- 2(pi i p) m + ppp) (10.417) 
2 (PoP + P1P3) 2(P2P3 — PoP:) -2(pi + p2) 


and can verify that AA? = 1 and Det[A] = +1 using Po = lis Pi - Pe - D3. Therefore, 


A € SO(3) is a valid rotation matrix. 

Next we seek the relationship between the Euler angles and the Cayley—Klein parame- 
ters. Although one can evaluate p’ = UpU”! using the Cayley—Klein parametrization in 
general, it is easier to choose the following special cases: 


Zoo Cos[2¢]  Sin[2¿] 0 
ooo. qu) > Yeu = —Sin[2¢] Cos[2¿] 0|=R.[2¿] (10.418) 
1 


0 0 
Cos[24] 0 Sin[2 
wa so (M Salal) L upg = | an 
=y] CO —Sin[24] 0 Coal 


(10.419) 
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Thus, we have expressed the two matrices needed for the Euler parametrization of SO(3) 
in terms of the Cayley—Klein parametrization of SU(2). A general rotation then takes the 
form 


3 o| U [o, 0, 5] (10.420) 


where the + reflects the sign ambiguity in U that arises because R was derived from 
UpU”!, which is second-order in U. This homomorphism is double-valued: two elements 
of SU(2) map onto the same element of SO(3). 

Transformation matrices for state vectors with j = 1/2 are, as usual, based upon the 
inverse of the rotation for coordinates 


(Ria, B, y)? = R I-NR,[-BIR,[-a] 


Ria, $, y] = R.[01R, [BIR [y] = U = +U [o, 0, =] U [o 


10.421) 
dz -lylo -£ 2 ( 
= Us +U 0,0, Jo, z ou foo, >) 
such that 
-i(a+y)/2 Cos A —eile-y)/2 Sin $] 
(1/2) _[e > 2 
D LA b y] = ( eil’-a)/2 Sin[ 5] eilaty)/2 Cos|4| (10.422) 


is a two-dimensional irreducible representation of SO(3) that describes states with angular 
momentum j = 1/2. It is also useful to factor the rotation matrix 


D° fa, B, y] = Exp| -iy 03/2] d°/” [8] Exp|-ia o;/2| (10.423) 


where the reduced rotation matrix is simply 


ame _ [Cos 8 — Sin E 
ena call) ae 
Notice that a rotation of the coordinate system by 27 around any axis simply inverts the 
sign of the wave function; a rotation through 47 is needed to recover the original wave 
function. The half-angles appear here because the homomorphism between SU(2) and 
SO(3) is double-valued. Despite this minor complication, this representation for D“/” 
provides a useful starting point for construction of irreducible representations of higher 
dimension. 


10.7.4 Irreducible Representations of SU(2) 


Consider two variables y, and y_ and construct linear differential operators 


1 ð 0 
= 10.42 
i 0 0 
J,= E 10.426 
eei 00:20 


1 ð ð 
= = 10.427 
J, ¿he xr) (10.427) 
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that act on functions of the form w[y,, x_]. With careful examination of commutators like 


i ð ô ôy ôy 
(I, > iJ, Y = ¿lb e +X, Ov It a oy oh Ov 
- + 


0 ow y ow 
T|X = TA ax, X- ax, X+ Oy 


l ow Y Y Py Py Ow Py 
- ale dy. oy ae Haz “ay, Ae) 
if ôy Pw cn 8Y Ow 
-ix * ay, + AX x_x, TAS ay oy? -Aay 
Py 
MN ay Oy. 
if op 0 
= 3 (eS e) 
= JW 
(10.428) 


one soon verifies that these operators satisfy the commutation relations 
[Fp Fj] = 18,54, (10.429) 


required for angular momentum operators. Next, suppose that y assumes the specific form 


= (2j)! +m m 
eg = a] G+m!(j—-m! are (10.430) 


of homogeneous polynomials of order 27 where m = —j,-j+1,..., j— 1, j and where the 
normalization factor is chosen to ensure that 


j J : 
È Maf = D (2) = (xP + be] = 


m=—j m=—j 


a 


(10.431) 


when the basis vectors are normalized according to | xl +| xl = 1. Then using 


ô ; 
Wim = GY A IL jm = jm (10.432) 


0 . 
X+ ay, im = (J + MW; w a 


we observe that y ,,, is an eigenfunction of J, with eigenvalue m. Similarly, using 


jm 


y ee ra fa ð ð 
Patel eP= 2 2 3 3 
x + y + Z 4 F Oy? EXE Ox? + XX OV, OK. + X4 Ox, + La OY. 


(10.433) 
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such that 


PY m = HG +m)(j +m- 1)+(j-m(j-m-1)+2(j+m(j-m)+3(j +m) 
+3(j = mY m 


= IG + DY jm 
(10.434) 


we find that y”, is an eigenfunction of J? with eigenvalue j(j + 1). Therefore, y, con- 
stitutes a suitable set of basis functions for representations of the rotation group. 
We now require the variables y,, y_ to transform under SU(2) according to pu? [a, B, yl, 


such that 


jm 


172 
, (1/2) 
Xn = Y, Xp Pnim(@ By] (10.435) 
m'=-1/2 


where x, ¡,, is equivalent to y, . It is sufficient to work out the case y = y = 0, for which 


ITA" = (co) - sinf Sh)” (si) + cofle)” 
j > Sr (’ a 4 "| (10.436) 


B j—m+u-v 
5 V,M+V 
2 A 


j+m-u+v 
2j-u- 
| Xy X. 


v=0 
x Cos 5 Sin} 
2 
where the second line arises from the binomial theorem. The terms under the summation 


can be expressed in terms of W;,,, using the substitution 4 + v > j — m such that 


j ; ; 
niem y yy ¡om i-m!—y J+*m J=m 
ey er METE 
m=-j 
m+m!+2v 2j-m-m' -2v 
x cos|5| sin[5| i xm ye (10.437) 


where the summation over y is limited by the binomial coefficients, which vanish when 
the lower argument is negative. Upon insertion of the normalization factors 


j ; , A / 1/2 A A 
Po _y jm -v (j+m')!(j-m’')! jJ+m Jm 
Wim T > Si y G FDG Tm! ) e =m = „J y ) 


m=-j Y 


m+m' +2v 2 j-m-m! -2v 
x Cos|5| sin|5| Wim (10.438) 
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we identify the rotation matrix as 


F ; , A 1/2 . AN 
dentes = (E22 my ie age | ") 


G +m)! = m)! T= =Y 
mm! +2v 2j-m-m' -2v 
x Cos|5| sin|5 | (10.439) 


or 


0) jm! 
dim [6] = Ey 


2 2 
NO j+m NGN calb aoso 
= a j-m -y v 0115] re 


The summation over y is expressible in terms of a hypergeometric function or a Jacobi 
polynomial in Cot[8/2]?, but that does not really simplify the result. One should verify 
that our previous result for 4/2 is contained within this formula. Carrying this one step 
further, one obtains the rotation matrix for spin-1 


yes daa iP lf 


Cos|£]' - Si Sin[£]) (11 +Cos[B) -S4 1a -Cos[B)) 


l v2 
dD =| 2 coja -F |-| =P Coss] - 
Sin[£]' Supl Cos]£]") (301 -Cosis)) 72 51 +Cos[B)) 


(10.441) 


and observes that half-angles are not required for integral j. Wave functions for half- 
integral spin change sign upon rotation through 27, whereas those for integral spin do 
not. 

Below we tabulate some of the symmetries and special values for rotation matrices 
but do not provide proofs. Note that / is restricted to integers while j may be integral 
or half-integral. Be aware that some authors employ different phase conventions for both 
rotation matrices and spherical harmonics. 
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0) 4r 
Dm ola, p, y] = | J41 Y, ml a] (10.442) 


dmolBl = =)” k a = y P, [cos £] (10.443) 

doo[8] = Picos $] (10.444) 

dy mB) = dw [B] (10.445) 

di mln] = (S an (10.446) 
dt ml Bl = CY” door 1B] (10.447) 
dy mB) = O ll = (IB (10.448) 

Dy mla, B, YI" = Dir lo, B, y] = C)” "Dw, -nla B, 71 (10.449) 


10.7.5  Orthogonality Relations for Rotation Matrices 


Now that we have an expression for the reduced rotation matrix, the matrices for arbitrary 
rotations can be evaluated using 


DO [a, B, y] = Exp[-iaJ,] d” [6] Exp|-iyJ.] (10.450) 
= Dy nla, B, y] = Exp|-i(my + m'a)| d’ 1B] (10.451) 


Let w = (a, £, y} denote the Euler angles. According to the orthogonality theorem for 
compact Lie groups, we expect the representation matrices to satisfy an orthogonality 
relation of the form 


i. (Diente) DI lwoo] dea = “imental f plo] dw (10.452) 
2j+1 

where p[w] is an appropriate density function and where the integration includes all group 
elements. It is rather difficult to derive the density function directly because the compo- 
sition law for Euler angles is complicated, but some physical reasoning will provide the 
required function. Consider the motion of the z-axis under the rotations described by y and 
£. The first acts as an azimuthal angle while f acts as a polar angle, describing any point 
on the unit sphere. Thus, we expect a uniform density for y and a Sin[f] density for the 
polar angle. The final rotation by «œ is also azimuthal with a uniform density. Therefore, 
we expect that p[w] = Sin[f] and write 


f (Pentel) DItolpto do 


2 f WW nlBldy'n 18] Expli(n — 1)y] Exp[i(m’ - n’) a] Sin[8] de dB dy (10.453) 


where d(” is real. Orthogonality with respect to magnetic quantum numbers must be pro- 
vided by the integrations over a and y. If j, j’ are either both integral or both half-integral 
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the differences between magnetic quantum numbers are integral and integration over the 
range (0, 27) ensures orthogonality, but if one is integral and the other half-integral, the 
difference in magnetic quantum numbers is half-integral and integration over the range 
(0, 47) is needed for both œ and y. This should not be surprising, at least in retrospect, 
because the identity element for subgroups consisting of rotations about a fixed axis is a 
rotation of 47 for a half-integral spin. Therefore, the covering group capable of accommo- 
dating both integral and half-integral angular momentum features an enlarged parameter 
space with O < a < 4r and O < y < 47. 

Orthogonality with respect to j is then left to integration over £. Treating d\[B] as 
representations of a one-parameter subgroup, we expect 


T D (7) , iu Mo. 
1 din'm [w]din'm [w] Sin[£] df = — f Sin[£] dp (10.454) 
0 27+1 Jo 
or 
T D g’) : 2 
f din’ ml@]din' mw] Sin[B] dB = 2j+1 7 197 (10.455) 


It would be rather difficult to obtain this result directly from the power series for dY 
without using the underlying theorems from group theory. We can now put everything 
together to obtain 


Arr TT Ar . > A 2 
r * 32. 
f da f dp Sin[£] f AD im [a, Bb, yl (Dyin la, Bb, Y) = 5; y Om and n 
0 0 0 ahi eee 
(10.456) 


as the orthogonality theorem for representations of the rotational covering group without 
performing arduous integrations. 


10.7.6 Coupling of Angular Momenta 


Direct products of two irreducible representations of the rotation group produce a Clebsch- 
Gordan expansion 


DiGi) = es | j) DY (10.457) 


J 


Recall that all rotations through the same angle belong to the same class regardless of axis. 
Thus, the easiest way to determine the reduction coefficients is to evaluate the character 


KEH w] = 410110] (10.458) 
for w = {0, 0, 6} to obtain 
ji h 


j 
> y (jija | j) Expl-img] = > y Exp|-i(m, +m) 4] (10.459) 


j m=-j my==j, MA 
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Multiplying both sides by Exp[ing] and integrating over 0 < ¢ < 47 to account for possible 
half-integral quantum numbers, we obtain 


J j 
D (Aol A) Snn = > y Ôm, +m,n (10.460) 


j m=-j m ==), m=-j 
Satisfying these constraints limits j to the range |j, — jal < j < ji + j,. The total number of 
terms on the right-hand side is (27, + 1)(2j, + 1). If we assume, without loss of generality, 
that j; = j,, the total number of terms on the left hand side is 

Jith 

My (2j+ 1) = (2), + 1)(2/ +1) (10.461) 

j=j 

Therefore, every (j, j, |j) in the allowed range equals 1, such that 


Dei) = pliz e plizi!) e... e pita) (10.462) 


shows that the rotation group is simply reducible. 
The direct product of two angular momentum eigenstates can now be expanded in 
terms of irreducible representations according to 


= h h 
W jm, Y jm, a » a My 


jm 


m 


4 Vim (10.463) 


where the Clebsch-Gordan coefficients can be obtained by generalizing the relationship 


a) vin 

Y DHID Igbesi = = mel JE 7) (10.464) 
i Ki jik 

geG My 

for discrete groups to the continuum according to 
An Arr 

1 da f dB Sin[£] f A wo, [oe alo} 

= 27 A A AMA 21) (10.465) 
2j+1\m č m|m\m m,|m 


Notice that for the special cases m’ = j, the rotation matrices have only a single term: 


D (2j)! 1/2 f B j-m B j+m E f 

Dj, m [w] = emre sin) | Cos) | Exp[—i(my + jæ) (10.466) 
(2j)! 1/2 ; B j+m B j-m : i 

Ere] Sin|5 | Cos|5 | Exp[-i(my = jæ] 


De ale = yen 7 
(10.467) 
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Consider the special case mi = Jp m = — Jo» for which 
An An 
(i) (2) (i) x 
f da ia dp Sinl6] f AyD a o Lc atat 


_ 327? ( h J o J 
2 +11, zh |J pm mm 


J ) (10.468) 
m 
and let 
k dl (ir) (i) 
l= da ag males | ayy A [w]D- jam [wD mio)" 


_ cn war (2j,)! Gemini nt)” 
(ji +m)! Um) (jz + m)! (ir - my)! Q +m) -= m)! 


pai j+m j-m 
UE aa tee E 

4r An B Jj M¡+j¿+m,+2j-m=m!-2v 
xf aa f ap sinpi | dy Sin) =| 

0 0 0 2 


B J, +m, + jy—m,+m+m' +2v 
x Cos 5 | 


Exp|- i(m, +m, - m) y] Exp|- ifj =j =m a] 
(10.469) 


where y is an integer whose range is limited by the binomial coefficients in the expansion. 


: ; ies rane i a pee : 
Integration over y requires m = m, + m, while integration over « requires m = j, — jz, 
such that 


j (27,)! (2j,)! 
I= (47), m +m Om! Pes] yin 
Mm, +My MM, jij (i +m,)! (i, - m,)! (ja +m)! (ja -m )! 
¿US Ji +j)! (+j — j,)! y 
(j + m)\(j — m)! 
j j m/2 
tir j+m JM y f dn Sinfi 2h -2-21 
A l Eo y ) i 7 Sin[n] 1 


x Cos[y]*! +2m,+2v+1 
(10.470) 


Using our old friend the beta function (recall Sec. 2.6.2) 


1/2 f E F T 1 
1 Cos[9]?”"! Sin[6]?4"! dé = da ¿Bl», al (10.471) 
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the integral reduces to 


I = 3276 ô peim 


mm +m, mn, j Ja 

«| (2/1)! Ri)! G- Ath) +A =) 
(i, +m)! (ji — m,)! (ip +m)! (ja — my)! (j + m)\(j — m)! 

“rer j+m [Ue a nr 


1/2 


j-ith-v\ y Tits, + +2] 
(10.472) 
and we obtain 
(i h| j e ha a 
h hlkh-=2m m |m 
= Ôm, m, +m Ôm, jij, (2j + DO 
1/2 


x (2/1)! (24)! (hth) +j- i) 


(i; +m,)!(j, -= m; )! (ja +m)! (5, — my)! (j+m)!(j —m)! 
«ror jm pum rm 


Pht BV Y Titi, + i +2] 
(10.473) 
as the product of CG coefficients. Substitution of m, = jį, m, = —j, then gives 
A errno oS 


y Tl2j, +v+1]T[j- ¿+ 3-v+1] 
Titi + +2) 


(10.474) 


The preceding two equations provide enough information to determine the full set of CG 
coefficients for any (m,,m,, m}, but we would prefer to perform the summation above in 
closed form. Here we simply quote the result 


| jh h j 

h ~J2 | Ji 7 Ja 
because evaluation of the sum is a bit tricky and we are not especially interested, at this 
point, in techniques for manipulating such expressions; the details are postponed to the 


exercises. We are free to choose the positive root for this particular CG coefficient. Also 
note that the fact that j + j; + j, is an integer helps to simplify the phase. The remaining 


f — Pi) (2)! 


= (27 +1) Whe 
O ee Pll! 


(10.475) 
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coefficients then become 
h in | J 
m m,|m 


= O mm, +m, A Lis. J» j] (cima 


(es Dti O 
(i+ A+ in + WEA +m)! (A, =m)! (ja +m)! (ja =m)! 
or em Ys om 
v(j-m-y)!(j-j +j- y)! (j -jy + mv)! 


(10.476) 


where the triangle test 


lli-=il<sisjt+i 
Alj jn J| = (10.477) 
ide il l otherwise 
denotes the selection rule for j. Notice that all CG coefficients are real. 
Explicit formulas without summation can be found for small angular momenta in stan- 
dard compilations. Those references also provide useful symmetry relations for CG coef- 
ficients and related quantities, but we will not pursue those details here. 


10.7.7 Spherical Tensors 


Operators T, „ that transform among themselves under rotations as components of an irre- 
ducible representation of the rotation group, such that 


j , 
RE, EAS Y T yy Dit mR] (10.478) 


jm 
m=-j 


are known as spherical tensors of rank j. We also require phases to ensure that complex 
conjugation properties are the same as those for spherical harmonics, such that 


Tin = EL (10.479) 
and we define the scalar product of two spherical tensors according to 
j j 
AB = >) Ay Bim = >, O AB (10.480) 


m=-j m=—j 


Similarly, we define tensor products based upon coupling of angular momentum states 
according to 


4,08),,- ie 2 


m,m, 


J 
A (10.481) 


m 
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Matrix elements of spherical tensors are expressed most simply using the Wigner—Eckart 
theorem in the form 


ele J l , 
Gm, Ein Liem) = e H Ja, ITI jò (10.482) 


Jy 
My 
where the reduced matrix element (j IIT, j;) is independent of magnetic quantum numbers 
and can be evaluated using whatever choices for those quantum numbers that prove sim- 
plest. Obviously, these matrix elements vanish unless |j -— jlsjsj ptj;andm=m;-m,; 
satisfy the selection rules for angular momentum coupling. 

For infinitesimal rotations we expand 


R > 1 -ich-J + Ole] = RT, „R! > T,,, —ieñ-[J,T, ] (10.483) 


Jm Jm Jm 


where fi is a unit vector and express the rotation matrix in terms of matrix elements of J as 


DY nIR] = (j, m | Explieñ-] 1 jm) > 5,9 = (G, m lieh- Ìlj, m) (10.484) 
to obtain commutation relations 
IJ, Tn] = D T, w(i m AT j, m) (10.485) 


between the components of the angular momentum operator and those of the spherical 
tensor. Thus, we obtain commutation relations 


Lo, Tal = mT, » (10.486) 
Mo Timl = Timer iG + 2) =múm + 1) (10.487) 


for spherical tensors that are often easier to verify than their transformation properties. 
Naturally, we find FA Tool = 0 for scalar operators. 

For example, the appropriate linear combinations of a cartesian vector should trans- 
form as a spherical tensor of rank 1. To identify those combinations, we express cartesian 
coordinates in terms of spherical harmonics as 


An 1 
z = Sin[9] Cos[¢] = ,| 3 göl $1-Y, 10, 41) (10.488) 
? = Sin[6] Sin[] = | TE AE (10.489) 
r = = 3 V2 1,-1L%> LiL i 


: = Cos[8] = ,| Ey, olb g] (10.490) 
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Thus, it is useful to define spherical unit vectors as 


1 
è =-= +5) (10.491) 
V2 
1 
ê =—(%-i§) (10.492) 
1 V2 
e, =2 (10.493) 


with the same conjugation property 
en = EME (10.494) 


as spherical harmonics and with the orthonormality condition 


nw lêm = E * Em = Omm (10.495) 
Spherical components of a cartesian vector are then obtained using 

Tin = Ên 17) (10.496) 
such that 

rı = (e, |P) = -x + iy = -Eman $] (10.497) 


4 
r,=(@,|Pextiy=-] = rY, 116, 9] (10.498) 


4 
ry = |? =2=,| = rY, of, 0] (10.499) 


Thus, 


i a 
2 4r a 4r n 
r=r 3 y e a NE Y, ` €] (10.500) 
m=-1 
expresses the basic cartesian vector in a spherical basis using polar coordinates. 
The angular momentum operator J is itself a spherical tensor. Thus, we can express its 
components in the form 


R 1 1 
Ja = @,, Ð =-=, + iJ) = - I. (10.501) 
+1 +1 V2 y V2 + 
so | 1 
Ja= (ê lD) = =U, - ,) = 4 (10.502) 
! 2 ” y2 
h= lJ) =J, (10.503) 


where one must not confuse the raising and lowering operators, J, and J_, with the spher- 
ical components, J,, and J_,, despite their notational similarities. Matrix elements then 
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take the form 


G m (Ja Liem) = FY G F mG £m + 1)/2 by a (10.504) 
(j, w | Jol j, m) = mô, (10.505) 


m,m 


and we can use the special value 
e = _ (10.506) 


j A 
Eee ViG*D 


to deduce the reduced matrix element 


(7 05,0 i) = 5, jG (10.507) 


where u = 0,+1. Thus, the commutation relations for a spherical tensor can now be 
expressed in the form 


Se res E 
Mi Ta = JU ES Je u 


Finally, if T, „ = V„ is a vector operator with j = 1, we can express these commutation 


relations in cartesian form as 


2 7 H Ts (10.508) 


Y, A = le, 


A (10.509) 


where the summation convention is employed and i, j,k € {x, y, z}. Notice that this rela- 
tionship contains the SU(2) commutation relations as a special case. Although we leave 
the algebra to the exercises, this form is very helpful in manipulating operators composed 
of products of 7, p, and L. 


10.8 Unitary Symmetries in Nuclear and Particle Physics 


No discussion of group theory in physics would be complete without at least mentioning 
the assignment of elementary particles to multiplets and the dynamical consequences of 
their associated group symmetries. The most obvious multiplets are those associated with 
isospin symmetry in nuclear physics. The difference between the masses of the proton 
and neutron is so small, only 0.14%, that they can be considered to be two states of the 
same particle, the nucleon, that are degenerate with respect to the strong interaction while 
the small splitting is attributed to the much weaker electromagnetic interaction. Thus, the 
nucleon (N) is assigned an internal variable, called isospin , that is analogous to spin. 
We assume that isospin states transform according SU(2) with generators T, satisfying the 
commutation relations 


[T, T,] = is, 


pl; ij dz (10.510) 
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and that the proton and nucleon belong to a dimension-two representation with 


T?IN) = ¿IN) (10.511) 
T;|p) = +31p) (10.512) 
Tin) = —3|n) (10.513) 


Similarly, the three charge states of the pion (a+, 2°, 117) are practically degenerate and 
are assigned to a dimension three representation of SU(2) isospin. The most prominent 
feature of low-energy interactions between pions and nucleons is a strong resonance in the 
1 = 1 partial wave with four charge states labeled A++, A+, A°, and A” that are assigned 
isospin 3. Expressing the charge states 


af >) (10.514) 


ep) ie 
a la Tce 
pr) =f $92) - f2 [2 40516 
i pag] a (10.517) 
|n) = = 2272). a (10.518) 


3/2 
Be (10.519) 


nn) = | 
in terms of isospin eigenstates and assuming that the isospin > contributions to elastic 1N 
scattering are negligible at the resonance, isospin symmetry predicts that cross-sections 
for the charge states will be found in the ratios 


T [z* p] Co [p] ¿ola pl=ol[ar ahe [x°n| LO [a*a] =9:2:1 (10.520) 


and data for the resonance region are indeed consistent with these ratios. Isospin symmetry 
also plays a central role in the structure of atomic nuclei. 

At a deeper level, isospin symmetry is understood to arise from the flavor symmetry of 
the interaction between quarks and the near degeneracy between the masses of the lightest 
quarks, called up and down (u or d) based upon the older isospin up or isospin down ter- 
minology. It is natural to try to extend this symmetry to include also strange baryons con- 
taining one or more strange quarks even though the s quark is significantly more massive 
than the u or d quarks. The mass degeneracy within the resulting SU(3) multiplets is less 
precise, but SU(3) flavor symmetry does help to explain many of the properties of baryons 
with useful accuracy. We will not discuss the elegant methods that have been developed 
for analyzing the structure of direct-product representations of SU(N), but merely quote a 
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Figure 10.4. Weight diagrams for baryons assigned to symmetric octet (left) and decuplet (right) 
representations of SU(3) flavor symmetry. 


couple of results. The coupling of three quarks, each belonging to SU(3), can be decom- 
posed into irreducible representations according to 


38393=108,508y, 010 (10.521) 


where the singlet is fully antisymmetric, one octet is symmetric and the other antisymmet- 
ric with respect to exchange of the first two quarks, and the decuplet is symmetric with 
respect to the exchange of any pair of quarks. The mixed-symmetry representations (MS 
and MA) can be constructed by coupling two quarks in either symmetric or antisymmet- 
ric configurations and then coupling a third. The singlet is identified with the A(1405) 
particle, while the other low-lying baryons are assigned to multiplets represented by the 
accompanying weight diagrams. The hypercharge Y is defined so that the electric charge 


is Q = T, + Y/2. Each member of 8,, s has spin and parity i = ue while the decuplet has 


JP = 3", 

This model of the baryon spectrum was called the eightfold way by Gell-Mann, co- 
inventor of the quark model with Zweig, because the SU(3) group has eight generators. 
The prediction and subsequent discovery of the O” was a major success of the early quark 
model. Even when symmetries apply only to one sector of a model and are broken by 
another, group theoretical analyses often explain systematics well without detailed theories 
or calculations of the underlying dynamics. Subsequent developments in particle theory 
have relied heavily on such techniques, but are beyond the scope of this text. 


Problems for Chapter 10 
1. Which are groups? 


Which of the following sets constitute groups under the stipulated law of combination? 
Justify your answers: 
a) real numbers under multiplication 


b) real numbers under addition 
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c) all complex numbers except 0 under multiplication 
d) rational numbers under multiplication 


e) rational numbers under a * b = a/b. 


2. Groups of order 4 
Prove that groups of order 4 must be abelian and that there are only two distinct groups of 
order 4. 


3. Class period 
Show that all elements of the same class have the same period. 


4. Quaternions 

Just as complex numbers are generalizations of real numbers to two dimensions, quater- 
nions are generalizations of complex numbers to four dimensions. Suppose that q = a + 
bi+cj+dk where a, b, c, d are real numbers and the quaternion basis vectors 1, i, j, k satisfy 
the following multiplication rules 


P=P=k=-1 =k, fi=-k (10.522) 


a) Show that the set {+1, +i, +/, +k} constitute a group, Q, and that the set of all g is an 
algebra. 


b) Determine the classes and construct the character table for this group. 


c) Construct a faithful four-dimensional representation and resolve it into irreducible rep- 
resentations. 


5. Commutator subgroups 

Suppose that a and b are elements of group G. The commutator for ab is defined as the 
element c € G for which c(ab) = ba. Let H consist of all elements of G that can be 
expressed as products of commutators, such that A = c,c,---c,,. Prove that H is an invariant 
subgroup of G. 


6. Prove that factor groups are abelian 

Suppose that H is an invariant subgroup of G. The factor group is defined by F = G/H = 
{H, g,H, g,H,...} where the g, are all elements of G that are not contained in H. Prove 
that F is abelian. 


7. 3d representation of S, 
Construct and verify a faithful three-dimensional representation of $4. 


8. Group of matrices with unit determinant 

Show that the set of all n x n complex matrices with unit determinant is an invariant 
subgroup of the nonsingular complex n x n matrices and that the corresponding factor 
group is isomorphic to the set of complex numbers excluding 0. 


9. Irreps of abelian groups 
Show that every irreducible representation of an abelian group is one-dimensional. 
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10. Symmetries of a square 
Show that the symmetries of a square are a subgroup of S4. Display the group multiplica- 
tion table and determine the classes and proper subgroups. 


11. Rotational symmetries of a tetrahedron 
A regular tetrahedron is a solid that contains four sides that are equilateral triangles. 


a) Show that the rotational symmetries of a tetrahedron are a subgroup of S,. (We exclude 
twists of a single face.) 


b) Display the group multiplication table and determine its classes. 


c) Construct the character table. 


12. Character table for S, 

Construct the character table for S,. (Hint: if you encounter a situation in which there are 
two solutions for one of the columns or rows, you should be able to choose the desired 
solution by considering the two-dimensional representation for elements of order 2.) 


13. Vibrating triangle 
Three equal masses m are connected by three equal springs k forming an equilateral trian- 
gle in its equilibrium configuration. 


a) Construct the potential-energy matrix and demonstrate explicitly that it is invariant with 
respect to S3. 


b) Use group-theoretical arguments to deduce the eigenvalues for small-amplitude vibra- 
tions of this system without solving a secular equation. 


14. Vibrating square with central mass 

Suppose that four masses m occupy the vertices of a square and that another is at the center. 
The corner masses are connected by four springs k and the corners are connected to the 
central mass by four springs kk. 


a) Construct the potential energy matrix for small-amplitude vibrations and verify that it 
is symmetric wrt to the symmetry group of a square. 

b) Use group theory to obtain the vibrational frequencies. (Hint: use Tr[V.V] to distinguish 
between multiple eigenvalues.) 


15. Infinite groups are not necessarily reducible 
We showed that representations of finite groups are equivalent to unitary representations. 
As such, if a representation is partially reducible it is fully reducible to block diagonal 
form. This is not necessarily true for infinite groups. 


a) Show that 
EM] = i ) (10.523) 


for real / is a representation of a Lie group and identify the group. 
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b) Show that L[7] cannot be reduced and is not unitary. 


16. Monomial representations of translation group 
Show that the set P,,, = {1,x,---,x”} provides a representation basis of the translation 
group T[a]x = x + a and determine its transformation matrices. Verify explicitly that these 
transformation matrices satisfy the group multiplication law T[a]T[b] = T[a + b]. Is this 


representation reducible? Is it unitary; if not, why not? 


17. Time evolution operator for two-state system 
A two-state system has the Hamiltonian 


H = Hl- pò -B (10.524) 


where A, is constant, yz is the magnetic moment, and B is a constant magnetic field. The 
easiest method to evaluate the eigenvalues and eigenvectors is to choose the quantization 
axis parallel to B, but the analysis in terms of an arbitrary direction provides valuable 
practice with matrix exponentiation and the use of rotation matrices. 


a) Use matrix exponentiation to evaluate the time evolution operator for arbitrary B. 

b) If the field were along the z-axis, we would write the eigenvectors as (1,0) and (0,1) with 
respect to that axis. For an arbitrary direction, we should be able to obtain the eigenvec- 
tors using the spin-3 rotation matrix. Verify that states constructed in that manner are 
indeed eigenstates of H. 


18. SO(4) 
The group SO(4) consists of all orthogonal matrices in four dimensions that have unit 
determinant, thereby leaving the Euclidean metric xq + x9 +6 + x4 invariant. 


a) Show that any matrix belonging to SO(4) can be expanded in the form 


3 
M = Expli ) (a,A; + b;B;) (10.525) 
i=l 
where the nonvanishing elements of the basis matrices are 
(Ai), = Hier 1S AK <3 (10.526) 
(B,), =i (10.527) 
(B); =i (10.528) 


and where the coefficients a,, b; are real numbers. Evaluate the commutation relations 
for this basis. 


b) Show that the six generators for the associated Lie group of transformations acting upon 
these coordinates take the form 


(4,47, Ay, By, By, B3} = {Ap 3, A3, p 41,2 41,4 424 434) (10.529) 


ð 0 
A; j =-] h2 115 (10.530) 
j i 
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and satisfy the following commutation relations. 


[Ai Aj] = ie, ¿Ay (10.531) 
[A; B,] = i£; ; By (10.532) 
|B; B,] = 18; ¡As (10.533) 


c) Show that the linear transformation 


J,=4(A, +B) (10.534) 


K, = 1 (A; - B,) (10.535) 


1 


Il 


allows one to decompose SO(4) = SO(3) € SO(3) into two invariant subalgebras and 
deduce the commutation relations for the new operators. What is the rank of SO(4)? Is 
it simple? Is it semisimple? 

d) Determine the Casimir operators for SO(4). Evaluate the corresponding matrices for 
coordinate transformations. 


19. Lorentz group in one spatial dimension 
A Lorentz boost B[v] performs the coordinate transformation 


t = y(t + vx) (10.536) 
x’ = y(x + vt) (10.537) 


where y = (1 —v*)"!?. 


a) Show that B[v] is a Lie group and deduce its law of composition. Interpret this result. 


b) Deduce the infinitesimal generator K, evaluate the finite transformation Exp[i@K] in 
closed form, and determine the relationship between « and v. 


c) Construct the operator for transformation of functions f[t, x] and discuss its superficial 
resemblance to the generator for rotations. How is it related to the Galilean transforma- 
tion? 


20. Lorentz group 

a) Construct matrices K, for infinitesimal boosts and L; for infinitesimal rotations of a 
Lorentz four-vector. Then evaluate the structure constants for the Lorentz group and 
verify compliance with the Jacobi identity. What is the rank of this group? It is important 
to recognize that these matrices comprise a particular representation of the abstract 
group that is defined in terms of their commutation relations, which are more general. 


b) Evaluate the metric and show that the Lorentz group is semisimple. Express the Casimir 
operator C, in terms of L and K and obtain the corresponding matrix for this particular 
representation. 


c) Show that there are linear combinations of L and K that allow the Lie algebra to be 
separated into two disjoint subalgebras; in other words, show that commutation relations 
for each subalgebra are closed and do not involve operators from the other subalgebra. 
Then construct the Casimir operators for each subalgebra. 
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21. Alternative representation of S for s = 1 

The infinitesimal generators derived in the text for the s = 1 representation of SU(2) 
do not diagonalize S}. Find a similarity transform that diagonalizes $}, with decreasing 
eigenvalues on the diagonal, and makes S, real. Then verify that the proper commutation 
relations are obtained in the new basis. 


22. SU(3) 
SU(3) matrices are conventionally parametrized in the form 


U = Exp|-56n i] (10.538) 


where O is a real number, fi is a real unit vector that represents an axis in the internal 
coordinate space, and 


01 0 0 -i 0 1 0 0 
a=|1 0 ol, a=li 0 Of, a =|o -1 ol, (10.539) 
0.00 0.0.0 0 0 0 
001 0 0 =i 00 0 
a =|o o ol, A =|0 0 0l, 1% =|o0 0 1], (10.540) 
1.00 io o 010 
00 0 0 0 
4,=]0 0 -il, == 1 0 (10.541) 
0 i 0 V3lo o 2 


are traceless hermitian 3 x 3 matrices. The appearance of Pauli matrices in the upper-left 
blocks of {A,, A), 13) reveals an SU(2) subgroup. 


a) Tabulate the structure constants for this group and verify that they meet the requirements 
of a Lie algebra. 


b) Show that anticommutators can be expressed in the form 


k 
{Aj Aj} = ad; + AA, =0, , + df Ay (10.542) 


P J 
and tabulate df j 


c) Evaluate Tr A;À;. 


23. Baker-Haussdorff identity 
Verify the Baker—Haussdorff identity. 


24. Does orbital angular momentum / = 1/2 exist? 
Show that the ladder operators for orbital angular momentum take the forms 


L, = +Exp[+i@] (5 + ¿Coal 5) (10.543) 
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in polar coordinates. Let the eigenfunctions for a hypothetical system with / = 5 be repre- 
sented in the form 


Wir = ule”? (10.544) 
Wi = viele? (10.545) 


and use the requirements 

L Yin =0 LW. =90 (10.546) 
to obtain first-order differential equations for u[6] and v[@]. Finally, show that 

L_u+av, L,v#bu (10.547) 


where a and b are constants. Therefore, a representation of SO(3) with / = 5 does not 
exist. More generally, one can show that eigenfunctions of L? and L} must have integral /. 


25. Weights for one-dimensional Lorentz transformations 
Recall that the law of composition for the group of one-dimensional Lorentz transforma- 
tions is expressed either as 


vi tv 


v = o [vo v2] = een (10.548) 
for the velocity parametrization or as 
m =| m] =m, +m (10.549) 


in terms of rapidity v = Tanh[ņ]. Evaluate the density function for group integration using 
both parametrizations and then re-express those results in terms of dp/E. 


26. Compare left and right densities 
Suppose that matrices of the form 


e b 
E l (10.550) 


represent a Lie group. Determine the group multiplication laws for {a, b} and then evaluate 
both left and right invariant densities for its parameter space. 


27. Density for Euclidean transformations 
a) The transformations of the Euclidean plane that preserve distances can be parametrized 
by the three-parameter group 


x’ = a+xCos[6] + y Sin[6] (10.551) 
y = b — xSin[0] + y Cos[8] (10.552) 


Evaluate the left and right invariant densities. 
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b) Alternatively, the Euclidean symmetries can be parametrized as 


x’ = pCos[d] + x Cos[6] + y Sin[8] (10.553) 
y = p Sin[ġ] — x Sin[6] + y Cos[6] (10.554) 


Determine the density function for this parametrization. 


28. Density for SU(2) 
a) Show that 


( Po + ip; P2 + a (10.555) 
—P2 +Y1P1 Po 1P3 
with real p, constrained according to po == pi po as E is a parametrization of SU(2) 
with three parameters. Evaluate the left and right densities. 
b) Show that the angular parametrization 


( e Cos[6] el Sin[6] ) 


-e Y Sin[9] e”**Cos[8] ee) 


is also a parametrization of SU(2) and use the preceding result to deduce the corre- 
sponding density without working out the composition laws for these parameters. 


29. Homomorphism between SU(2) and SO(3) 

Here we ask you to use MATHEMATICA’ or other symbolic manipulation software to com- 
plete some of the steps in the demonstration of the homomorphism between SU(2) and 
SO(3). This can be done by hand, of course, but would be tedious. Let 


ds Po+ip; P+ 3 (10.557) 
—P2 t+ lP, Po 1P3 


where the parameters are real and where 
Po =l- pi- P- Pi (10.558) 


ensures unitarity. Also let 


ae Z x-— iy 
p=: T= , Dip En ) (10.559) 
Evaluate p’ = UpU”! and deduce the 3 x 3 matrix A that performs the rotation 7” = Ar. 
Demonstrate that A is orthogonal and that Det[A] = +1. 


30. Rotations using space-fixed or body-fixed axes 
The text describes two methods for parametrizing rotations in terms of Euler angles. The 
first, 


Ria, B, y] = Ry [y]R; BIRLa, (10.560) 


employs a sequence of transformations based upon axes embedded in the physical system, 
described as body-fixed axes. Each transformation produces a new set of coordinate axes. 
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The second, 
Rla, p, y] = R¿la]R, [61R,[y), (10.561) 


describes a sequence of transformations using a common set of axes, described as space- 
fixed axes. Prove that both methods produce the same rotation. 


31. Commutation relations for orbital angular momentum 
Verify that the differential operators for orbital angular momentum satisfy the commuta- 


tion relations [L,, L, jl= = Le, j rlr 


32. Explicit formula for special CG coefficient 
We derived the formula 


A : ap? 
i h| j ) 
Ji =j Ji > Jo 


=e 


End (WEAS 
x DC ye fe j ) (10.562) 
Breve 

T[j+ 3, + +2] 


in the text and would like to perform the summation. Use the identity 


yUIntv+1] (k)_ TIk+m—nin+1] 
LO T[m+v+1] (5) =Tlk+m+1][m-—n] Gruan) 
to obtain 
” 5 5 2 Y) Y) 
i | J ; Pi (2/1)! (2s) (10.564) 
Ji Be | Jv ds (i, + io - ACA, + do +3 +1)! 


33. Integration of three spherical harmonics 
Evaluation of matrix elements of spherical tensors often requires integration of a product 
of three spherical harmonics. Evaluate 


fr, r m (8 91%,,, 18, OY; 10, 61 dO (10.565) 


in terms of Clebsch-Gordan coefficients. Then deduce the reduced matrix element (/ | 
Y, 11,). 


34. Commutation relations for products 

Demonstrate the following commutation relations for products. 
[A, BC] = B[A, C] + [A, B]C (10.566) 
[AB, C] = A[B, C] + [A, C]B (10.567) 
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35. Commutation relations for vectors 
a) Verify the commutation relation 


[Li Vj] = is, ¿Va (10.568) 


where L is the orbital angular momentum and V is an arbitrary vector operator. Thus, 
show that L- V = V - L for any vector operator V. 


b) Verify [L, A- B] = [2?, A- B] = 0 for arbitrary vector operators A and B. Do not assume 
that A and B commute. 


c) Use commutation relations to verify that A x B transforms as a vector under rotations. 
Do not assume that A and B commute. 


d) Next, show that 
VxL+LxV =2iV (10.569) 
for any V. For example, the identity 
?=(PxL+Lx?) (10.570) 


is often useful. 


e) Show that 
|L, V] =i(VxL-LxV)=2(V +iV xL) (10.571) 
for any V. Thus, [L?, L] = 0. 


36. Commutation relations involving 7, p, and L 
Derive each of the commutation relations listed below. Here p = —iV, L = ? x p, f = T/r. 


a) |P, P] = 3 (P+ itx L) = 4(#xL-Lx?) 
b) [PRL 1] = ¿PXL 


gr 
x xXx 
yl 
| = 
œ 

x |l 
D H~ 
"j= 


RN al o 


37. Runge-Lenz vector for hydrogenic atoms: An example of dynamical symmetry 
In this problem we use group-theoretical methods to analyze the spectrum of hydrogenic 
atoms without ever solving a differential equation. The method relies upon a generaliza- 
tion of the Runge—Lenz vector which, for the Kepler problem, points in the direction of 
the major axis of an elliptical orbit. The conservation of the classical Runge—Lenz vector 
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demonstrates that there is a dynamical symmetry for the inverse-square law that leads to 
closed elliptical orbits; small violations of this symmetry in general relativity are respon- 
sible for the famous precession of the perihelion of Mercury. One can show that the n? 
degeneracy of energy level E, in the Bohr model of hydrogenic atoms arises from a similar 
dynamical symmetry described by conservation of a quantum mechanical generalization 
of the Runge—Lenz vector. Here we guide the student through an analysis that is relatively 
straightforward, even if some of the algebra is tedious. 


a) Show that the symmetrized Runge—Lenz vector 
A=a(pxL-Lxp)-? (10.572) 


is hermitian and that it commutes with the hamiltonian 
2 
ey (10.573) 
2u r 
for a suitable choice of « (real). Thus, there is a dynamical symmetry and the full sym- 
metry group is based upon the six generators L and A. 


b) Show that 


2H 
A =1+5 (P +1) (10.574) 

HK 

c) Show that the symmetry operators satisfy the following commutation relations: 
[Lo L;] = i£; Lk (10.575) 
[L, Aj] = ie; ae (10.576) 
2H. 

[A A] =- en Lk (10.577) 


d) The presence of H in the preceding commutation relations shows that {A, L} do not 
form a Lie group with respect to the full Hilbert space. However, if we define new 
Runge—Lenz operators according to 


ES 2H > 
Vou 


where H > E within any subspace with common energy E < 0, the commutators for 
{A, L} do form a closed algebraic system. Demonstrate that the new system is closed. 
Then evaluate the commutation relations for the linear combinations 


j= 3(L+) (10.579) 
K=! (i = 1) (10.580) 
and prove that their structure constants satisfy the conditions required for a Lie group. 
Construct the Casimir operators, identify a complete set of quantum numbers, and spec- 


ify their allowed values. Note that for this system one of the Casimir operators is actu- 
ally redundant. 
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e) Use J? to compute the energy eigenvalues and determine the degeneracy of multiplets. 


38. Landé formula and deuteron magnetic moment 
In this problem you will derive a formula that is often useful in nuclear or atomic spec- 
troscopy and then will apply that formula to analyze the magnetic moment of the deuteron. 


a) Suppose that V is a vector operator. Show that 


(¡miJ-V | jm) 


(JIVI yj) = — (10.581) 
vi +1) 
for j > 0. 
b) The magnetic dipole operator for the deuterium nucleus takes the form 
p = 8p dd 8S + aL (10.582) 


where $ p and S, are proton and neutron spins and where the factor of 5 for the orbital 
angular momentum L arises because only the proton carries charge and it carries half 


the total orbital angular momentum. The ground-state wave function can be expressed 
as 


ly) =a|?s,) +b|*D,) (10.583) 


where a? + b? = 1 and where the standard spectroscopic notation specifies dd: ZE 
Evaluate the magnetic moment, defined by the matrix element 


u= (J, M | uy |J, M) (10.584) 


for the aligned substate with maximum magnetic quantum number. Given that g, = 
5.586uy, 8, = —3.826uy, and uy = 0.8574, where uy is the nuclear magneton, esti- 
mate the D-state probability, b?. Note that this model omits the contributions due to 
meson exchange between nucleons, but still gives a good approximation to the D-state 
admixture in the ground state. 
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